کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1179801 962798 2011 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Quantification and statistical significance analysis of group separation in NMR-based metabonomics studies
موضوعات مرتبط
مهندسی و علوم پایه شیمی شیمی آنالیزی یا شیمی تجزیه
پیش نمایش صفحه اول مقاله
Quantification and statistical significance analysis of group separation in NMR-based metabonomics studies
چکیده انگلیسی

Currently, no standard metrics are used to quantify cluster separation in PCA or PLS-DA scores plots for metabonomics studies or to determine if cluster separation is statistically significant. Lack of such measures makes it virtually impossible to compare independent or inter-laboratory studies and can lead to confusion in the metabonomics literature when authors putatively identify metabolites distinguishing classes of samples based on visual and qualitative inspection of scores plots that exhibit marginal separation. While previous papers have addressed quantification of cluster separation in PCA scores plots, none have advocated routine use of a quantitative measure of separation that is supported by a standard and rigorous assessment of whether or not the cluster separation is statistically significant. Here quantification and statistical significance of separation of group centroids in PCA and PLS-DA scores plots are considered. The Mahalanobis distance is used to quantify the distance between group centroids, and the two-sample Hotelling's T2 test is computed for the data, related to an F-statistic, and then an F-test is applied to determine if the cluster separation is statistically significant. We demonstrate the value of this approach using four datasets containing various degrees of separation, ranging from groups that had no apparent visual cluster separation to groups that had no visual cluster overlap. Widespread adoption of such concrete metrics to quantify and evaluate the statistical significance of PCA and PLS-DA cluster separation would help standardize reporting of metabonomics data.


► Use of the Mahalonobis distance to quantify cluster separation in PCA scores plots.
► Use of an F-test to assess if cluster separations are statistically significant.
► PCA and PLS-DA results compared with no scaling and Pareto scaling.
► Use of these techniques will help standardize reporting of metabonomics data.
► Redistribution of significant PCA loadings is demonstrated for Pareto scaling.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Chemometrics and Intelligent Laboratory Systems - Volume 109, Issue 2, 15 December 2011, Pages 162–170
نویسندگان
, ,