Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
1146468 | Journal of Multivariate Analysis | 2010 | 18 Pages |
Abstract
In this paper, we propose a new methodology to deal with PCA in high-dimension, low-sample-size (HDLSS) data situations. We give an idea of estimating eigenvalues via singular values of a cross data matrix. We provide consistency properties of the eigenvalue estimation as well as its limiting distribution when the dimension dd and the sample size nn both grow to infinity in such a way that nn is much lower than dd. We apply the new methodology to estimating PC directions and PC scores in HDLSS data situations. We give an application of the findings in this paper to a mixture model to classify a dataset into two clusters. We demonstrate how the new methodology performs by using HDLSS data from a microarray study of prostate cancer.
Keywords
Related Topics
Physical Sciences and Engineering
Mathematics
Numerical Analysis
Authors
Kazuyoshi Yata, Makoto Aoshima,