کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4499588 1319037 2006 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Predicting rRNA-, RNA-, and DNA-binding proteins from primary structure with support vector machines
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم کشاورزی و بیولوژیک (عمومی)
پیش نمایش صفحه اول مقاله
Predicting rRNA-, RNA-, and DNA-binding proteins from primary structure with support vector machines
چکیده انگلیسی

In the post-genome era, the prediction of protein function is one of the most demanding tasks in the study of bioinformatics. Machine learning methods, such as the support vector machines (SVMs), greatly help to improve the classification of protein function.In this work, we integrated SVMs, protein sequence amino acid composition, and associated physicochemical properties into the study of nucleic-acid-binding proteins prediction. We developed the binary classifications for rRNA-, RNA-, DNA-binding proteins that play an important role in the control of many cell processes. Each SVM predicts whether a protein belongs to rRNA-, RNA-, or DNA-binding protein class. Self-consistency and jackknife tests were performed on the protein data sets in which the sequences identity was <25%. Test results show that the accuracies of rRNA-, RNA-, DNA-binding SVMs predictions are ∼84%, ∼78%, ∼72%, respectively. The predictions were also performed on the ambiguous and negative data set. The results demonstrate that the predicted scores of proteins in the ambiguous data set by RNA- and DNA-binding SVM models were distributed around zero, while most proteins in the negative data set were predicted as negative scores by all three SVMs. The score distributions agree well with the prior knowledge of those proteins and show the effectiveness of sequence associated physicochemical properties in the protein function prediction. The software is available from the author upon request.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Theoretical Biology - Volume 240, Issue 2, 21 May 2006, Pages 175–184
نویسندگان
, , , , ,