کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
406769 678111 2014 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Improve the performance of co-training by committee with refinement of class probability estimations
ترجمه فارسی عنوان
بهبود عملکرد همکاری با کمیته با اصلاح برآوردهای احتمالی کلاس
کلمات کلیدی
همکاری آموزشی، یادگیری نیمه نظارتی، یادگیری گروهی احتمال کلاس، فاصله متریک، ویرایش داده ها
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

Semi-supervised learning is a popular machine learning technique where only a small number of labeled examples are available and a large pool of unlabeled examples can be obtained easily. In co-training by committee, a paradigm of semi-supervised learning, it is necessary to pick out a fixed number of most confident examples according to the ranking of class probability values at each iteration. Unfortunately, the class probability values may repeat, which results in the problem that some unlabeled instances share the same probability and will be picked out randomly. This brings a negative effect on the improvement of the performance of classifiers. In this paper, we propose a simple method to deal with this problem under the intuition that different probabilities are crucial. The distance metric between unlabeled instances and labeled instances can be combined with the probabilities of class membership of committee. Two distance metrics are considered to assign each unlabeled example a unique probability value. In order to prove that our method can get higher-quality examples and reduce the introduction of noise, a data editing technique is used to compare with our method. Experimental results verify the effectiveness of our method and the data editing technique, and also confirm that the method for the first distance metric is generally better than the data editing technique.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 136, 20 July 2014, Pages 30–40
نویسندگان
, , , ,