Article ID Journal Published Year Pages File Type
536489 Pattern Recognition Letters 2012 8 Pages PDF
Abstract

We consider the problem of using a large amount of unlabeled data to improve the efficiency of feature selection in high-dimension when only a small amount of labeled examples is available. We propose a new method called semi-supervised ensemble learning guided feature ranking method (SEFR for short), that combines a bagged ensemble of standard semi-supervised approaches with a permutation-based out-of-bag feature importance measure that takes into account both labeled and unlabeled data. We provide empirical results on several benchmark data sets indicating that SEFR can lead to significant improvement over state-of-the-art supervised and semi-supervised algorithms.

► We propose a new method called semi-supervised ensemble learning guided feature ranking method. ► It combines a bagged ensemble of semi-supervised approaches with a permutation-based out-of-bag feature importance measure. ► Both labeled and unlabeled data are taken into account. ► The method lead to significant improvement over state-of-the-art supervised and semi-supervised algorithms.

Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, , ,