کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4496285 1623867 2014 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Predicting DNA binding proteins using support vector machine with hybrid fractal features
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم کشاورزی و بیولوژیک (عمومی)
پیش نمایش صفحه اول مقاله
Predicting DNA binding proteins using support vector machine with hybrid fractal features
چکیده انگلیسی


• It is original to use CGR and fractal dimension to predict DNA-binding proteins.
• We conducted experiments on seven groups with different features.
• The best results: the average accuracy is 81.82% and average MCC is 0.6017.
• The results are compared with existing method DNA-Prot and shows better performances.

DNA-binding proteins play a vitally important role in many biological processes. Prediction of DNA-binding proteins from amino acid sequence is a significant but not fairly resolved scientific problem. Chaos game representation (CGR) investigates the patterns hidden in protein sequences, and visually reveals previously unknown structure. Fractal dimensions (FD) are good tools to measure sizes of complex, highly irregular geometric objects. In order to extract the intrinsic correlation with DNA-binding property from protein sequences, CGR algorithm, fractal dimension and amino acid composition are applied to formulate the numerical features of protein samples in this paper. Seven groups of features are extracted, which can be computed directly from the primary sequence, and each group is evaluated by the 10-fold cross-validation test and Jackknife test. Comparing the results of numerical experiments, the group of amino acid composition and fractal dimension (21-dimension vector) gets the best result, the average accuracy is 81.82% and average Matthew's correlation coefficient (MCC) is 0.6017. This resulting predictor is also compared with existing method DNA-Prot and shows better performances.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Theoretical Biology - Volume 343, 21 February 2014, Pages 186–192
نویسندگان
, , , ,