کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
2820945 1160909 2008 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
EnsemPro: An ensemble approach to predicting transcription start sites in human genomic DNA sequences
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی ژنتیک
پیش نمایش صفحه اول مقاله
EnsemPro: An ensemble approach to predicting transcription start sites in human genomic DNA sequences
چکیده انگلیسی

Although several computational methods have been developed to identify transcription start sites (TSSs)/promoters, the computational prediction still needs improvement. Due to low performance, the promoter prediction programs can provide misleading results in functional genomic studies. To improve the prediction accuracy, we propose the use of an ensemble approach, EnsemPro (Ensemble Promoter), which combines the prediction results of the existing promoter predictors. We schematically compared the prediction performance of the currently available promoter prediction programs in an identical evaluating environment, and the results served as a guide for choosing the combined predictors. We applied three representative ensemble schemes—the majority voting, the weighted voting, and the Bayesian approach—for the TSS prediction of hundreds of human genomic sequences. EnsemPro identified the TSSs more precisely than other combining methods as well as the currently available individual predictor programs. The source code of EnsemPro is available on request from the authors.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Genomics - Volume 91, Issue 3, March 2008, Pages 259–266
نویسندگان
, , , ,