Article ID Journal Published Year Pages File Type
5907767 Genomics 2016 5 Pages PDF
Abstract

•A novel formulation based on Maximum–Minimum Correntropy Criterion (MMCC) is investigated for gene selection.•The optimal number of gene to be selected is explored.•The robustness, speed and stability of MMCC are investigated by broad experimental results on numerous data sets.

One of the central challenges in cancer research is identifying significant genes among thousands of others on a microarray. Since preventing outbreak and progression of cancer is the ultimate goal in bioinformatics and computational biology, detection of genes that are most involved is vital and crucial. In this article, we propose a Maximum–Minimum Correntropy Criterion (MMCC) approach for selection of informative genes from microarray data sets which is stable, fast and robust against diverse noise and outliers and competitively accurate in comparison with other algorithms. Moreover, via an evolutionary optimization process, the optimal number of features for each data set is determined. Through broad experimental evaluation, MMCC is proved to be significantly better compared to other well-known gene selection algorithms for 25 commonly used microarray data sets. Surprisingly, high accuracy in classification by Support Vector Machine (SVM) is achieved by less than 10 genes selected by MMCC in all of the cases.

Related Topics
Life Sciences Biochemistry, Genetics and Molecular Biology Genetics