کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
1168473 | 1491159 | 2009 | 7 صفحه PDF | دانلود رایگان |
عنوان انگلیسی مقاله ISI
Predicting liquid chromatographic retention times of peptides from the Drosophila melanogaster proteome by machine learning approaches
دانلود مقاله + سفارش ترجمه
دانلود مقاله ISI انگلیسی
رایگان برای ایرانیان
کلمات کلیدی
Random forest - جنگلهای تصادفی یا جنگلهای تصمیم تصادفیQuantitative structure–retention relationship - رابطه ساختاری حفظ ساختار کمیGaussian process - فرآیند گاوسیPeptide - پپتید liquid chromatography - کروماتوگرافی مایعleast-squares support vector machine - کمترین مربعات دستگاه بردار پشتیبانی می کند
موضوعات مرتبط
مهندسی و علوم پایه
شیمی
شیمی آنالیزی یا شیمی تجزیه
پیش نمایش صفحه اول مقاله

چکیده انگلیسی
Three machine learning algorithms as least-squares support vector machine (LSSVM), random forest (RF) and Gaussian process (GP) were used to model the quantitative structure-retention relationship (QSRR) for predicting and explaining the retention behavior of proteome-wide peptides in the reverse-phase liquid chromatography. Peptides were parameterized using CODESSA approach and 145 descriptors were obtained for each peptide, including diverse structural information such as constitutional, topological, geometrical and physicochemical property. Based upon that, the nonlinear LSSVM, RF and GP as well as another sophisticated linear method (partial least-squares regression (PLS)) were employed in the QSRR model development. By a series of systematic validations as internal cross-validation, external test and Monte Carlo cross-validation, the stability and predictive power of the constructed models were confirmed. Results show that regression models developed using nonlinear approaches such as LSSVM, RF and GP predict better than linear PLS models. Considering the retention times used in this work were measured in different columns and thus have a relatively large uncertainty (reproducibility within 7%), the optimal statistics obtained from GP modeling are satisfactory, with the coefficients of determination (R2) for training set and test set of 0.894 and 0.866, respectively.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Analytica Chimica Acta - Volume 644, Issues 1â2, 30 June 2009, Pages 10-16
Journal: Analytica Chimica Acta - Volume 644, Issues 1â2, 30 June 2009, Pages 10-16
نویسندگان
Feifei Tian, Li Yang, Fenglin Lv, Peng Zhou,