کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1168473 1491159 2009 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Predicting liquid chromatographic retention times of peptides from the Drosophila melanogaster proteome by machine learning approaches
موضوعات مرتبط
مهندسی و علوم پایه شیمی شیمی آنالیزی یا شیمی تجزیه
پیش نمایش صفحه اول مقاله
Predicting liquid chromatographic retention times of peptides from the Drosophila melanogaster proteome by machine learning approaches
چکیده انگلیسی
Three machine learning algorithms as least-squares support vector machine (LSSVM), random forest (RF) and Gaussian process (GP) were used to model the quantitative structure-retention relationship (QSRR) for predicting and explaining the retention behavior of proteome-wide peptides in the reverse-phase liquid chromatography. Peptides were parameterized using CODESSA approach and 145 descriptors were obtained for each peptide, including diverse structural information such as constitutional, topological, geometrical and physicochemical property. Based upon that, the nonlinear LSSVM, RF and GP as well as another sophisticated linear method (partial least-squares regression (PLS)) were employed in the QSRR model development. By a series of systematic validations as internal cross-validation, external test and Monte Carlo cross-validation, the stability and predictive power of the constructed models were confirmed. Results show that regression models developed using nonlinear approaches such as LSSVM, RF and GP predict better than linear PLS models. Considering the retention times used in this work were measured in different columns and thus have a relatively large uncertainty (reproducibility within 7%), the optimal statistics obtained from GP modeling are satisfactory, with the coefficients of determination (R2) for training set and test set of 0.894 and 0.866, respectively.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Analytica Chimica Acta - Volume 644, Issues 1–2, 30 June 2009, Pages 10-16
نویسندگان
, , , ,