کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
533118 870061 2016 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Robust multiple-instance learning ensembles using random subspace instance selection
ترجمه فارسی عنوان
گروههای یادگیری چند نمونه قوی با استفاده از نمونه تصادفی زیرمجموعه
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی


• A new method, Random Subspace Instance Selection, is proposed to design MIL ensembles.
• The method yields ensembles that are robust to variations of witness rate, data distributions and noise.
• The method yields state-of-the-art results on several benchmark data sets.

Many real-world pattern recognition problems can be modeled using multiple-instance learning (MIL), where instances are grouped into bags, and each bag is assigned a label. State-of-the-art MIL methods provide a high level of performance when strong assumptions are made regarding the underlying data distributions, and the proportion of positive to negative instances in positive bags. In this paper, a new method called Random Subspace Instance Selection (RSIS) is proposed for the robust design of MIL ensembles without any prior assumptions on the data structure and the proportion of instances in bags. First, instance selection probabilities are computed based on training data clustered in random subspaces. A pool of classifiers is then generated using the training subsets created with these selection probabilities. By using RSIS, MIL ensembles are more robust to many data distributions and noise, and are not adversely affected by the proportion of positive instances in positive bags because training instances are repeatedly selected in a probabilistic manner. Moreover, RSIS also allows the identification of positive instances on an individual basis, as required in many practical applications. Results obtained with several real-world and synthetic databases show the robustness of MIL ensembles designed with the proposed RSIS method over a range of witness rates, noisy features and data distributions compared to reference methods in the literature.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 58, October 2016, Pages 83–99
نویسندگان
, , , ,