Article ID Journal Published Year Pages File Type
2078878 Chinese Journal of Biotechnology 2017 9 Pages PDF
Abstract
The comparative sequence analysis is the most reliable method for RNA secondary structure prediction, and many algorithms based on it have been developed in last several decades. This paper considers RNA structure prediction as a 2-classes classification problem: given a sequence alignment, to decide whether or not two columns of alignment form a base pair. We employed Support Vector Machine (SVM) to predict potential paired sites, and selected covariation information, thermodynamic information and the fraction of complementary bases as feature vectors. Considering the effect of sequence similarity upon covariation score, we introduced a similarity weight factor, which could adjust the contribution of covariation and thermodynamic information toward prediction according to sequence similarity. The test on 49 Rfamseed alignments showed the effectiveness of our method, and the accuracy was better than many similar algorithms. Furthermore, this method could predict simple pseudoknot.
Related Topics
Life Sciences Biochemistry, Genetics and Molecular Biology Biotechnology
Authors
, ,