کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4944453 1437994 2017 46 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A fast and accurate algorithm for unsupervised clustering around centroids
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
A fast and accurate algorithm for unsupervised clustering around centroids
چکیده انگلیسی
A centroid-based clustering algorithm is proposed that works in a totally unsupervised fashion and is significantly faster and more accurate than existing algorithms. The algorithm, named CLUBS+ (for CLustering Using Binary Splitting), achieves these results by combining features of hierarchical and partition-based algorithms. Thus, CLUBS+ consists of two major phases, i.e., a divisive phase and an agglomerative phase, each followed by a refinement phase. Each major phase consists of successive steps in which the samples are repartitioned using a criterion based on least quadratic distance. This criterion possesses unique analytical properties that are elucidated in the paper and exploited by the algorithm to achieve a very fast computation. The paper presents the results of the extensive experiments performed: these confirm that the new algorithm is fast, impervious to noise, and produces results of better quality than other algorithms, such as BOOL, BIRCH, and k-means++, even when the analyst can determine the correct number of clusters-a very difficult task from which users are spared by CLUBS+.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volumes 400–401, August 2017, Pages 63-90
نویسندگان
, , ,