کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6864858 1439552 2018 26 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Geometric mean metric learning for partial label data
ترجمه فارسی عنوان
میانگین یادگیری متریک برای داده های اطلاعات جزئی
کلمات کلیدی
یادگیری برچسب جزئی یادگیری متریک، میانگین هندسی وزنی، اطلاعات ضعیف تحت نظارت،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
Partial label learning (PLL) is a new weakly supervised learning framework that addresses the classification problems, where the true label of each training sample is concealed in a set of candidate labels. To learn from such weakly supervised training data, the key is to disambiguate the ambiguous labeling information. Because it is difficult to address by only focusing on the manipulation in the label space, manifold structure among training data in the feature space has gradually been exploited simultaneously to facilitate the disambiguation process by researchers in recent years. However, the manifold structure is commonly analyzed under an assumption that the samples close to each other in the feature space will share identical labels in the label space, which may be not correct in many real-world problems. In this paper, geometric mean metric learning approach is employed to learn a distance metric for PLL problems such that can maintain the aforementioned assumption correct in as many situations as possible. It is significantly more challenging than the conventional setup of distance metric learning because it is difficult to precisely identify whether a pair of training samples belong to the same class. We propose an alternative approach in which each training sample and its neighbor with shared candidate label are taken as a similarity pair, and each training sample and its neighbor without shared candidate label are taken as a dissimilarity pair. Considering that two samples with shared candidate label do not necessarily come from the same class, a weight is placed on each similarity pair. The experimental results on twenty four controlled UCI data sets and six real-world PLL problems show the proposed distance metric learning approach can be used as a front end of both the PLL algorithms exploiting the manifold structure among training data and other existing distance-based PLL algorithms to significantly improve their performance.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 275, 31 January 2018, Pages 394-402
نویسندگان
, ,