کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
5907707 1160860 2016 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Non-coding RNA identification based on topology secondary structure and reading frame in organelle genome level
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی ژنتیک
پیش نمایش صفحه اول مقاله
Non-coding RNA identification based on topology secondary structure and reading frame in organelle genome level
چکیده انگلیسی


- The triplets under reading frame in the secondary structure of ncRNA are selected.
- Topology secondary structures of ncRNAs are used as information parameters.
- A method of SVM combining the increment of diversity ID algorithm is presented.

Non-coding RNA (ncRNA) genes make transcripts as same as the encoding genes, and ncRNAs directly function as RNAs rather than serve as blueprints for proteins. As the function of ncRNA is closely related to organelle genomes, it is desirable to explore ncRNA function by confirming its provenance. In this paper, the topology secondary structure, motif and the triplets under three reading frames are considered as parameters of ncRNAs. A method of SVM combining the increment of diversity (ID) algorithm is applied to construct the classifier. When the method is applied to the ncRNA dataset less than 80% sequence identity, the overall accuracies reach 95.57%, 96.40% in the five-fold cross-validation and the jackknife test, respectively. Further, for the independent testing dataset, the average prediction success rate of our method achieved 93.24%. The higher predictive success rates indicate that our method is very helpful for distinguishing ncRNAs from various organelle genomes.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Genomics - Volume 107, Issue 1, January 2016, Pages 9-15
نویسندگان
, , ,