کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6370791 1623879 2013 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Using protein granularity to extract the protein sequence features
ترجمه فارسی عنوان
با استفاده از دانه بندی دانه بندی برای استخراج ویژگی های پیوند پروتئین
کلمات کلیدی
ترکیب آمینو اسید، اثر طولی، پیش بینی کلاس ساختاری، ماشین بردار پشتیبانی، افزایش،
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم کشاورزی و بیولوژیک (عمومی)
چکیده انگلیسی

The feature extraction of protein sequences is a challenging problem. It might need a lot of theoretical and practical knowledge from many fields. The difficulty would increase when investigators extract the features solely from protein sequences. In this paper, we present a method of protein granularity. The concepts of protein granularity, granularity order, granularity bound, granularity limit, and granularity increment are given respectively. The protein granularity can dig out the useful information solely from protein sequences. We provide an approach to construct the feature vectors. The feature vectors include the amino acid composition information, the sequence-order information, the same amino acid 'neighbor' information, and the sequence length information. Hence, the feature vectors can better represent protein sequences. Our feature extraction method does obviously consider the protein sequence length effects. An experiment of the protein structure class prediction was carried out. The prediction achieved 96.6% overall accuracy, and the success rate for each subset is all-α 92.3%, all-β 100%, α/β 100%, α+β 93.5%, respectively. The last three success rates for subsets are equal to the best success rates in the published literatures. The overall accuracy of PG-SVM prediction is the second best result only having one protein prediction error difference with the first best result. The theoretical and experimental results demonstrate the application of protein granularity succeeds in the feature extraction of protein sequences.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Theoretical Biology - Volume 331, 21 August 2013, Pages 48-53
نویسندگان
, , , ,