Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
407878 | Neurocomputing | 2013 | 13 Pages |
Feature selection has been widely discussed as an important preprocessing step in machine learning and data mining. Evaluation criterion designing arises as a main aspect for constructing feature selection algorithms. In this paper, a new feature evaluation criterion, called the neighborhood effective information ratio (NEIR), is proposed to compute discernibility capability of categorical and numerical features. Based on the evaluation criterion, a general definition of significance of hybrid features is presented. Then a greedy selection algorithm for hybrid feature subsets based on the proposed evaluation criterion is constructed for data classification. We compare the proposed algorithm with other feature selection algorithms. Both theoretical and experimental analysis verifies the effectiveness and the efficiency of the proposed algorithm.
► Novel evaluation criterion NEIR for hybrid feature selection. ► Measuring the decision discernibility in the neighborhood approximation space. ► NEIR based selection algorithm can be applied to hybrid data directly. ► The size of neighborhood is properly set to better fit a specific learning task. ► results verify the effectiveness and the efficiency of our algorithm.