کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1179209 962764 2015 18 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Determining the number of components in principal components analysis: A comparison of statistical, crossvalidation and approximated methods
ترجمه فارسی عنوان
تعیین تعداد اجزا در تجزیه و تحلیل اجزای اصلی: مقایسه آماری، اعتبارسنجی و روش های تقریبی
کلمات کلیدی
ماتریس کوواریانس، توزیع تریسی-ویدوم، ارزیابی ابعاد، نظریه ماتریس تصادفی، تجزیه و تحلیل واقعی
موضوعات مرتبط
مهندسی و علوم پایه شیمی شیمی آنالیزی یا شیمی تجزیه
چکیده انگلیسی


• We compare different methods for dimensionality assessment in PCA.
• Cross-validation, approximated and statistical random matrix theory are considered.
• Methods are compared on simulated and real data.
• Differential behavior is observed and commented among methods.
• Guidelines for practitioners are offered.

Principal component analysis is one of the most commonly used multivariate tools to describe and summarize data. Determining the optimal number of components in a principal component model is a fundamental problem in many fields of application. In this paper, we compare the performance of several methods developed for this task in different areas of research. We consider statistical methods based on results from random matrix theory (Tracy–Widom and Kritchman–Nadler testing procedures), cross-validation methods (namely the well-characterized element wise k-fold algorithm, ekf, and its corrected version cekf) and methods based on numerical approximation (SACV and GCV). The performance of these methods is assessed on both simulated and real life data sets. In both cases, differential behavior of the considered methods is observed, for which we propose theoretical explanations.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Chemometrics and Intelligent Laboratory Systems - Volume 149, Part A, 15 December 2015, Pages 99–116
نویسندگان
, ,