کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
406196 678069 2015 19 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Correction of noisy labels via mutual consistency check
ترجمه فارسی عنوان
اصلاح برچسب های پر سر و صدا از طریق بررسی صحت متقابل
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی


• Label noise can have a severe negative effect on the performance of a classifier.
• We propose a new pre-processing method to correct noisy labels via mutual consistency check using a Parzen window classifier.
• We have used the Spannogram framework to solve the problem with approximation guarantees.
• Experiment has been done on a large number of data-sets with different types of noise.
• We show that the proposed method is able to detect noisy labels with high precision and recall.

Label noise can have severe negative effects on the performance of a classifier. Such noise can either arise by adversarial manipulation of the training data or from unskilled annotators frequently encountered in crowd sourcing (e.g. Amazon mechanical turk). Based on the assumption that an expert has provided some fraction of the training data, where labels can be assumed to be true, we propose a new pre-processing method to identify and correct noisy labels via a mutual consistency check using a Parzen window classifier. While the resulting optimization problem turns out to be a combinatorial problem, we design an efficient algorithm for which we provide approximation guarantees. Extensive experimental evaluation shows that our method performs similar and often much better than existing methods for the detection of noisy labels, thus leading to a boost in performance of the resulting classifiers.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 160, 21 July 2015, Pages 34–52
نویسندگان
, ,