Clustering, classification, discriminant analysis, and dimension reduction via generalized hyperbolic mixtures

Article ID	Journal	Published Year	Pages	File Type
6869362	Computational Statistics & Data Analysis	2016	18 Pages	PDF

Abstract

A method for dimension reduction with clustering, classification, or discriminant analysis is introduced. This mixture model-based approach is based on fitting generalized hyperbolic mixtures on a reduced subspace within the paradigm of model-based clustering, classification, or discriminant analysis. A reduced subspace of the data is derived by considering the extent to which group means and group covariances vary. The members of the subspace arise through linear combinations of the original data, and are ordered by importance via the associated eigenvalues. The observations can be projected onto the subspace, resulting in a set of variables that captures most of the clustering information available. The use of generalized hyperbolic mixtures gives a robust framework capable of dealing with skewed clusters. Although dimension reduction is increasingly in demand across various application areas, many applications are biological and so some of the real data examples are within that sphere. Simulated data are also used for illustration.

Keywords

Model-based classification Generalized hyperbolic distribution Model-based clustering Mixture models Dimension reduction