Article ID Journal Published Year Pages File Type
383892 Expert Systems with Applications 2013 8 Pages PDF
Abstract

We consider a feature selection problem where the decision-making objective is to minimize overall misclassification cost by selecting relevant features from a training dataset. We propose a two-stage solution approach for solving misclassification cost minimizing feature selection (MCMFS) problem. Additionally, we propose a maximum-margin genetic algorithm (MMGA) that maximizes margin of separation between classes by taking into account all examples as opposed to maximizing margin of separation using a few support vectors. Feature selection is carried out by either an exhaustive or a heuristic simulated annealing approach in the first stage and a cost sensitive classification using either MMGA or cost sensitive support vector machines (SVM) in the second stage. Using simulated and real-world data sets and different misclassification cost matrices, we test our two-stage approach for solving the MCMFS problem. Our results indicate that feature selection plays an important role when misclassification cost asymmetries increase and the MMGA shows equal or better performance than the SVM.

► Developed a hybrid maximum margin maximizing genetic algorithm (MMGA) approach for feature selection. ► Developed a feature selection based cost sensitive support vector machine (SVM). ► Compared feature selection based MMGA and SVM approaches using simulated and real-world datasets.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
,