Article ID Journal Published Year Pages File Type
415841 Computational Statistics & Data Analysis 2012 8 Pages PDF
Abstract

We propose a simple method to assess the number of subpopulations in multivariate data by projecting the data on its principal curve and then applying Silverman’s bandwidth test to the resulting univariate sample. Our results indicate that this method works well even in high-dimensional settings with relatively small sample sizes, provided that the number of subpopulations is not large compared to the number of dimensions.

Related Topics
Physical Sciences and Engineering Computer Science Computational Theory and Mathematics
Authors
, ,