کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4943145 1437621 2017 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Semi-supervised model-based clustering with controlled clusters leakage
ترجمه فارسی عنوان
خوشه بندی مبتنی بر مدل نیمه نظارت شده با نشتی خوشه های کنترل شده
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
In this paper, we focus on finding clusters in partially categorized data sets. We propose a semi-supervised version of Gaussian mixture model, called C3L, which retrieves natural subgroups of given categories. In contrast to other semi-supervised models, C3L is parametrized by user-defined leakage level, which controls maximal inconsistency between initial categorization and resulting clustering. Our method can be implemented as a module in practical expert systems to detect clusters, which combine expert knowledge with true distribution of data. Moreover, it can be used for improving the results of less flexible clustering techniques, such as projection pursuit clustering. The paper presents extensive theoretical analysis of the model and fast algorithm for its efficient optimization. Experimental results show that C3L finds high quality clustering model, which can be applied in discovering meaningful groups in partially classified data.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 85, 1 November 2017, Pages 146-157
نویسندگان
, , ,