Convergent random forest predictor: Methodology for predicting drug response from genome-scale data applied to anti-TNF response

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
2821327	1160941	2009	10 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Drug response prediction Classifiers - طبقه بندی ها

موضوعات مرتبط

علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی ژنتیک

پیش نمایش صفحه اول مقاله

Convergent random forest predictor: Methodology for predicting drug response from genome-scale data applied to anti-TNF response

چکیده انگلیسی

Biomarker development for prediction of patient response to therapy is one of the goals of molecular profiling of human tissues. Due to the large number of transcripts, relatively limited number of samples, and high variability of data, identification of predictive biomarkers is a challenge for data analysis. Furthermore, many genes may be responsible for drug response differences, but often only a few are sufficient for accurate prediction. Here we present an analysis approach, the Convergent Random Forest (CRF) method, for the identification of highly predictive biomarkers. The aim is to select from genome-wide expression data a small number of non-redundant biomarkers that could be developed into a simple and robust diagnostic tool. Our method combines the Random Forest classifier and gene expression clustering to rank and select a small number of predictive genes. We evaluated the CRF approach by analyzing four different data sets. The first set contains transcript profiles of whole blood from rheumatoid arthritis patients, collected before anti-TNF treatment, and their subsequent response to the therapy. In this set, CRF identified 8 transcripts predicting response to therapy with 89% accuracy. We also applied the CRF to the analysis of three previously published expression data sets. For all sets, we have compared the CRF and recursive support vector machines (RSVM) approaches to feature selection and classification. In all cases the CRF selects much smaller number of features, five to eight genes, while achieving similar or better performance on both training and independent testing sets of data. For both methods performance estimates using cross-validation is similar to performance on independent samples. The method has been implemented in R and is available from the authors upon request: Jadwiga.Bienkowska@biogenidec.com.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Genomics - Volume 94, Issue 6, December 2009, Pages 423–432

نویسندگان

Jadwiga R. Bienkowska, Gul S. Dalgin, Franak Batliwalla, Normand Allaire, Ronenn Roubenoff, Peter K. Gregersen, John P. Carulli,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Convergent random forest predictor: Methodology for predicting drug response from genome-scale data applied to anti-TNF response

دسترسی سریع

ارتباط

English Website