Article ID Journal Published Year Pages File Type
494910 Applied Soft Computing 2016 11 Pages PDF
Abstract

•We propose two new, simple, and efficient Hybrid Feature Selection techniques.•We use a feature-based ranking to initialize the Binary Differential Evolution.•We also propose a new fitness function influenced by the features in the population.•Several statistical tests show the robustness and effectiveness of the proposals.•The reducing of the size of the original set of features is larger than 99%.

Microarray experiments generally deal with complex and high-dimensional samples, and in addition, the number of samples is much smaller than their dimensions. Both issues can be alleviated by using a feature selection (FS) method. In this paper two new, simple, and efficient hybrid FS algorithms, called respectively BDE-XRankXRank and BDE-XRankfXRankf, are presented. Both algorithms combine a wrapper FS method based on a Binary Differential Evolution (BDE) algorithm with a rank-based filter FS method. Besides, they generate the initial population with solutions involving only a small number of features. Some initial solutions are built considering only the most relevant features regarding the filter method, and the remaining ones include only random features (to promote diversity). In the BDE-XRankfXRankf, a new fitness function, in which the score value of a solution is influenced by the frequency of the features in the current population, is incorporated in the algorithm. The robustness of BDE-XRankXRank and BDE-XRankfXRankf is shown by using four Machine Learning (ML) algorithms (NB, SVM, C4.5, and kNN  ). Six high-dimensional well-known data sets of microarray experiments are used to carry out an extensive experimental study based on statistical tests. This experimental analysis shows the robustness as well as the ability of both proposals to obtain highly accurate solutions at the earlier stages of BDE evolutionary process. Finally, BDE-XRankXRank and BDE-XRankfXRankf are also compared against the results of nine state-of-the-art algorithms to highlight its competitiveness and the ability to successfully reduce the original feature set size by more than 99%.

Graphical abstractFigure optionsDownload full-size imageDownload as PowerPoint slide

Keywords
Related Topics
Physical Sciences and Engineering Computer Science Computer Science Applications
Authors
, , ,