کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1181289 1491544 2014 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
An improved QSPR method based on support vector machine applying rational sample data selection and genetic algorithm-controlled training parameters optimization
موضوعات مرتبط
مهندسی و علوم پایه شیمی شیمی آنالیزی یا شیمی تجزیه
پیش نمایش صفحه اول مقاله
An improved QSPR method based on support vector machine applying rational sample data selection and genetic algorithm-controlled training parameters optimization
چکیده انگلیسی


• A novel QSPR method was proposed to improve the predictive abilities of SVM models.
• The effect of the input variables on the SVM models was investigated in theory.
• Optimal parameters and input representations of SVM models were optimized globally.

An improved QSPR method based on support vector machine (SVM) applying rational sample data selection and genetic algorithm (GA)-controlled training parameters optimization was developed to study the standard formation Gibbs free energy of 78 kinds of acyclic alkanes. The SVMs were trained applying the standard regression algorithm based on quadratic programming theory, and the Gaussian radial basis kernel function (RBKF) was employed in the training process. Meanwhile, eight well-known topological indices were used as structural descriptors for each alkane molecule, and they were also considered to be the potential input variables for the proposed QSPR models. Subsequently, by optimizing the ε parameter in insensitive loss function, the penal factor C, the σ parameter in RBKF and the input variable representations simultaneously via GA, a novel QSPR approach based on the combination of GA and SVM was proposed to improve the prediction results of the independent external test samples. For independent external test samples selected randomly prior to QSPR model development, an improved predictive modeling method based on SVM was achieved by rationally selecting the training and the internal test data set with sphere exclusion algorithm and optimizing the SVM training parameters by the proposed GA method.For comparing purpose, partial least square (PLS) regression method was also used as another QSPR modeling tool for the experimental data set. Moreover, to verify the improved modeling method in a more general way, two mathematically simulated QSPR data sets were built to confirm its validity.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Chemometrics and Intelligent Laboratory Systems - Volume 134, 15 May 2014, Pages 34–46
نویسندگان
,