Feature selection in machine learning: A new perspective

Article ID	Journal	Published Year	Pages	File Type
6863885	Neurocomputing	2018	13 Pages	PDF

Abstract

High-dimensional data analysis is a challenge for researchers and engineers in the fields of machine learning and data mining. Feature selection provides an effective way to solve this problem by removing irrelevant and redundant data, which can reduce computation time, improve learning accuracy, and facilitate a better understanding for the learning model or data. In this study, we discuss several frequently-used evaluation measures for feature selection, and then survey supervised, unsupervised, and semi-supervised feature selection methods, which are widely applied in machine learning problems, such as classification and clustering. Lastly, future challenges about feature selection are discussed.

Keywords

Feature selection Data mining Dimensionality reduction Machine learning