کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
406195 678069 2015 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Small margin ensembles can be robust to class-label noise
ترجمه فارسی عنوان
گروه های حاشیه ای کوچک می توانند سر و صدا با برچسب های کلاس قوی باشند
کلمات کلیدی
سر و صدا برچسب، بسته بندی طبقه بندی حاشیه کوچک، نمونه برداری بوت استرپ
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

Subsampling is used to generate bagging ensembles that are accurate and robust to class-label noise. The effect of using smaller bootstrap samples to train the base learners is to make the ensemble more diverse. As a result, the classification margins tend to decrease. In spite of having small margins, these ensembles can be robust to class-label noise. The validity of these observations is illustrated in a wide range of synthetic and real-world classification tasks. In the problems investigated, subsampling significantly outperforms standard bagging for different amounts of class-label noise. By contrast, the effectiveness of subsampling in random forest is problem dependent. In these types of ensembles the best overall accuracy is obtained when the random trees are built on bootstrap samples of the same size as the original training data. Nevertheless, subsampling becomes more effective as the amount of class-label noise increases.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 160, 21 July 2015, Pages 18–33
نویسندگان
, , ,