Article ID Journal Published Year Pages File Type
6855036 Expert Systems with Applications 2018 22 Pages PDF
Abstract
Feature selection plays an important role in the classification problems associated with expert and intelligent systems. The central idea behind feature selection is to identify important input features in order to reduce the dimensionality of the input space while maintaining or improving classification performance. Traditional feature selection approaches were designed to handle either categorical or numerical features, but not the mix of both that often arises in real datasets. In this paper, we propose a novel feature selection algorithm for classifying mixed-type data, based on a rough set model, called feature selection for mixed-type data with feature space decomposition (FSMSD). This can handle both categorical and numerical features by utilizing rough set theory with a heterogeneous Euclidean-overlap metric, and can be applied to mixed-type data. It also uses feature space decomposition to preserve the properties of multi-valued categorical features, thereby reducing information loss and preserving the features' physical meaning. The proposed algorithm was compared with four benchmark methods using real mixed-type datasets and biomedical datasets, and its performance was promising, indicating that it will be helpful to users of expert and intelligent systems.
Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, ,