کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
403644 677297 2013 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Inter-training: Exploiting unlabeled data in multi-classifier systems
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Inter-training: Exploiting unlabeled data in multi-classifier systems
چکیده انگلیسی

We present a new and more general co-training style framework named Inter-training, to exploit unlabeled data in multi-classifier systems, and develop two concrete algorithms which employ some new strategies to iteratively retrain base classifiers. The decrease of diversity during iterations is a main problem which hinders the further improvement of co-training style algorithms. In this paper, we propose a method to recreate diversity among base classifiers by manipulating the pseudo-labeled data for co-training style algorithms. Furthermore, in the theoretical aspect, we define a hybrid classification and distribution (HCAD) noise and provide a Probably Approximately Correct (PAC) analysis for co-training style algorithms in the presence of HCAD noise. Experimental results on six datasets show that our method performs much better in practice, and the superiority is especially obvious on hardly-classified datasets.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 45, June 2013, Pages 8–19
نویسندگان
, , ,