کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6870530 681394 2014 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Estimating mutual information for feature selection in the presence of label noise
ترجمه فارسی عنوان
برآورد اطلاعات متقابل برای انتخاب ویژگی در حضور نویز برچسب
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی
A way to achieve feature selection for classification problems polluted by label noise is proposed. The performances of traditional feature selection algorithms often decrease sharply when some samples are wrongly labelled. A method based on a probabilistic label noise model combined with a nearest neighbours-based entropy estimator is introduced to robustly evaluate the mutual information, a popular relevance criterion for feature selection. A backward greedy search procedure is used in combination with this criterion to find relevant sets of features. Experiments establish that (i) there is a real need to take a possible label noise into account when selecting features and (ii) the proposed methodology is effectively able to reduce the negative impact of the mislabelled data points on the feature selection process.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computational Statistics & Data Analysis - Volume 71, March 2014, Pages 832-848
نویسندگان
, , ,