Comments on supervised feature selection by clustering using conditional mutual information-based distances

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
533342	870105	2013	6 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Conditional mutual information - اطلاعات متقابل متقابل Feature selection - انتخاب ویژگی Clustering - خوشه بندی Classification - طبقه بندی Naïve Bayes classifier - طبقه بندی Bayes نائومی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو

پیش نمایش صفحه اول مقاله

Comments on supervised feature selection by clustering using conditional mutual information-based distances

چکیده انگلیسی

Supervised feature selection is an important problem in pattern recognition. Of the many methods introduced, those based on the mutual information and conditional mutual information measures are among the most widely adopted approaches. In this paper, we re-analyze an interesting paper on this topic recently published by Sotoca and Pla (Pattern Recognition, Vol. 43 Issue 6, June, 2010, pp. 2068–2081). In that work, a method for supervised feature selection based on clustering the features into groups is proposed, using a conditional mutual information based distance measure. The clustering procedure minimizes the objective function named the minimal relevant redundancy—mRR criterion. It is proposed that this objective function is the upper bound of the information loss when the full set of features is replaced by a smaller subset. We have found that their proof for this proposition is based on certain erroneous assumptions, and that the proposition itself is not true in general. In order to remedy the reported work, we characterize the specific conditions under which the assumptions used in the proof, and hence the proposition, hold true. It is our finding that there is a reasonable condition, namely when all features are independent given the class variable (as assumed by the popular naive Bayes classifier), under which the assumptions as required by Sotoca and Pla's framework hold true.

► We discuss a paper by Sotoca and Pla on feature selection using mutual information.
► We show that two of the propositions in that paper are erroneous.
► We discuss the impact of these findings.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 46, Issue 4, April 2013, Pages 1220–1225

نویسندگان

Nguyen X. Vinh, James Bailey,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Comments on supervised feature selection by clustering using conditional mutual information-based distances

دسترسی سریع

ارتباط

English Website