Article ID Journal Published Year Pages File Type
4752647 Computational Biology and Chemistry 2017 10 Pages PDF
Abstract

•A two-stage local dimension reduction method for microarray data is proposed.•A new L1-regularized feature selection method is defined in first stage and it can obtain the global optimal solution.•Extensive experiments show the effectiveness of the proposed method.•The mining results can be used for subtype prediction.

Dimension reduction is a crucial technique in machine learning and data mining, which is widely used in areas of medicine, bioinformatics and genetics. In this paper, we propose a two-stage local dimension reduction approach for classification on microarray data. In first stage, a new L1-regularized feature selection method is defined to remove irrelevant and redundant features and to select the important features (biomarkers). In the next stage, PLS-based feature extraction is implemented on the selected features to extract synthesis features that best reflect discriminating characteristics for classification. The suitability of the proposal is demonstrated in an empirical study done with ten widely used microarray datasets, and the results show its effectiveness and competitiveness compared with four state-of-the-art methods. The experimental results on St Jude dataset shows that our method can be effectively applied to microarray data analysis for subtype prediction and the discovery of gene coexpression.

Graphical abstractDownload high-res image (58KB)Download full-size image

Related Topics
Physical Sciences and Engineering Chemical Engineering Bioengineering
Authors
, , , ,