کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
530614 869779 2013 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Predicting noise filtering efficacy with data complexity measures for nearest neighbor classification
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Predicting noise filtering efficacy with data complexity measures for nearest neighbor classification
چکیده انگلیسی

Classifier performance, particularly of instance-based learners such as k-nearest neighbors, is affected by the presence of noisy data. Noise filters are traditionally employed to remove these corrupted data and improve the classification performance. However, their efficacy depends on the properties of the data, which can be analyzed by what are known as data complexity measures. This paper studies the relation between the complexity metrics of a dataset and the efficacy of several noise filters to improve the performance of the nearest neighbor classifier. A methodology is proposed to extract a rule set based on data complexity measures that enables one to predict in advance whether the use of noise filters will be statistically profitable. The results obtained show that noise filtering efficacy is to a great extent dependent on the characteristics of the data analyzed by the measures. The validation process carried out shows that the final rule set provided is fairly accurate in predicting the efficacy of noise filters before their application and it produces an improvement with respect to the indiscriminate usage of noise filters.


► Noise filtering efficacy depends on the characteristics of the data.
► Data complexity measures can be used to predict when filtering will be profitable.
► A methodology to extract a rule set which performs that prediction is provided.
► The conditions under which a noise filter works well are similar for others.
► The overlapping and dispersion of the examples influence filtering efficacy.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 46, Issue 1, January 2013, Pages 355–364
نویسندگان
, , ,