Class prediction and gene selection for DNA microarrays using regularized sliced inverse regression

Article ID	Journal	Published Year	Pages	File Type
416621	Computational Statistics & Data Analysis	2007	14 Pages	PDF

Abstract

The monitoring of the expression profiles of thousands of genes have proved to be particularly promising for biological classification. DNA microarray data have been recently used for the development of classification rules, particularly for cancer diagnosis. However, microarray data present major challenges due to the complex, multiclass nature and the overwhelming number of variables characterizing gene expression profiles. A regularized form of sliced inverse regression (REGSIR) approach is proposed. It allows the simultaneous development of classification rules and the selection of those genes that are most important in terms of classification accuracy. The method is illustrated on some publicly available microarray data sets. Furthermore, an extensive comparison with other classification methods is reported. The REGSIR performance is comparable with the best classification methods available, and when appropriate feature selection is made the performance can be considerably improved.

Keywords

SIR Feature selection Microarray Classification Regularization Dimension reduction