Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4969604 | Pattern Recognition | 2017 | 28 Pages |
Abstract
Most of the current unsupervised feature selection methods are designed to process only numerical datasets. Therefore, in practical problems, where the objects under study are described through both numerical and non-numerical features (mixed datasets), these methods cannot be directly applied. In this work, we propose a new unsupervised filter feature selection method that can be used on datasets with both numerical and non-numerical features. The proposed method is inspired by the spectral feature selection, by using together a kernel and a new spectrum based feature evaluation measure for quantifying the feature relevance. Experiments on synthetic datasets show that in the 99% of the cases where the relevant features are known our method identifies and ranks the most relevant features at the beginning of a sorted list. Additionally, we contrast our method against state-of-the-art unsupervised filter methods over real datasets, and our method in most cases significantly outperforms them.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Vision and Pattern Recognition
Authors
Saúl Solorio-Fernández, José Fco. MartÃnez-Trinidad, J. Ariel Carrasco-Ochoa,