کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
35167 45079 2009 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Discriminating acidic and alkaline enzymes using a random forest model with secondary structure amino acid composition
موضوعات مرتبط
مهندسی و علوم پایه مهندسی شیمی بیو مهندسی (مهندسی زیستی)
پیش نمایش صفحه اول مقاله
Discriminating acidic and alkaline enzymes using a random forest model with secondary structure amino acid composition
چکیده انگلیسی

Understating the adaptation mechanism of enzymes to pH extremes and discriminating them is a challenging task and would help to design stable enzymes. In this work, we have systematically analyzed the secondary structure amino acid compositions of 105 acidic and 111 alkaline enzymes, respectively. We found that the propensity of the individual residues to participate in different secondary structures might be a general stability mechanism for their adaptation to pH extremes. Based on it, we present a secondary structure amino acid composition method for extracting useful features from sequence, and a novel ensemble classifier named random forest was used. The overall prediction accuracy evaluated by the 10-fold cross-validation reached 90.7%. Comparing our method with other feature extraction methods, the improvement of the overall prediction accuracy ranged from 5.5% to 21.2%. The random forests algorithm also outperformed other machine learning techniques with an improvement ranging from 3.2% to 19.9%.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Process Biochemistry - Volume 44, Issue 6, June 2009, Pages 654–660
نویسندگان
, , ,