Article ID Journal Published Year Pages File Type
1179909 Chemometrics and Intelligent Laboratory Systems 2011 12 Pages PDF
Abstract

Quantitative structure–property relationship (QSPR) models are widely used for prediction of properties, activities and/or toxicities of new chemicals. Validation strategies check the reliability of predictions of QSPR models. The classical metrics like Q2 and R2pred (Q2ext) are commonly used, besides other techniques, for internal validation (mostly leave-one-out) and external validation (test set validation) respectively. Recently, we have proposed a set of novel rm2 metrics which has been extensively used by us and other research groups for validation of QSPR models. In the present attempt, some additional variants of rm2 metrics have been proposed and their applications in judging the quality of predictions of QSPR models have been shown by analyzing results of the QSPR models obtained from three different data sets (n = 119, 90, and 384). In each case, 50 combinations of training and test sets have been generated, and models have been developed based on the training set compounds and subsequently applied for prediction of responses of the test set compounds. Finally, models for a particular data set have been ranked according to the quality of predictions. The role of different validation metrics (including classical metrics and different variants of rm2 metrics) in differentiating the “good” (predictive) models from the “bad” (low predictive) models has been studied. Finally, a set of guidelines has been proposed for checking the predictive quality of QSPR models.

Related Topics
Physical Sciences and Engineering Chemistry Analytical Chemistry
Authors
, , , ,