کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
2079012 1545053 2008 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Prediction of Outer Membrane Proteins Using Support Vector Machine with Combined Features
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی بیوتکنولوژی یا زیست‌فناوری
پیش نمایش صفحه اول مقاله
Prediction of Outer Membrane Proteins Using Support Vector Machine with Combined Features
چکیده انگلیسی
Discriminating outer membrane proteins (OMPs) from other folding types of globular and membrane proteins is an important task both for identifying OMPs from genomic sequences and for the successful prediction of their secondary and tertiary structures. This study describes a discriminative method based on combined feature vectors for protein sequence coding and machine learning techniques for classification. The new combined feature vector consists of three types of features: amino acid composition, dipeptide composition, and weighted AAindex correlation coefficient. Further, a classification system on the basis of the combined feature coding method and support vector machine algorithms is developed, which is named CF_SVM. In cross validation tests and independent dataset tests on a dataset of 1087 proteins belonging to all different types of globular and membrane proteins, CF_SVM outperforms other methods in the literature for discriminating OMPs and other proteins. The influence of different ranks and weights of AAindex correlation coefficients in the combined features on the accuracy for discrimination is also discussed. The results of OMPs mining in 14 bacterial genomes show that CF_SVM can predict OMP candidates with high specificities, and performs better than other OMPs mining tool TMBETA-GENOME.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Chinese Journal of Biotechnology - Volume 24, Issue 4, April 2008, Pages 651-658
نویسندگان
, , ,