کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
382833 660794 2015 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Sequential random k-nearest neighbor feature selection for high-dimensional data
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Sequential random k-nearest neighbor feature selection for high-dimensional data
چکیده انگلیسی


• The study proposes the ensemble-based feature selection algorithm.
• The proposed algorithm is especially useful for large p and small n problems.
• Experiments with 20 real data demonstrated the effectiveness of the proposed algorithm.

Feature selection based on an ensemble classifier has been recognized as a crucial technique for modeling high-dimensional data. Feature selection based on the random forests model, which is constructed by aggregating multiple decision tree classifiers, has been widely used. However, a lack of stability and balance in decision trees decreases the robustness of random forests. This limitation motivated us to propose a feature selection method based on newly designed nearest-neighbor ensemble classifiers. The proposed method finds significant features by using an iterative procedure. We performed experiments with 20 datasets of microarray gene expressions to examine the property of the proposed method and compared it with random forests. The results demonstrated the effectiveness and robustness of the proposed method, especially when the number of features exceeds the number of observations.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 42, Issue 5, 1 April 2015, Pages 2336–2342
نویسندگان
, ,