کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
407007 678124 2014 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Cluster validation in problems with increasing dimensionality and unbalanced clusters
ترجمه فارسی عنوان
اعتبارسنجی خوشه ای در مشکلات با افزایش ابعاد و خوشه های نامتعادل
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

Cluster validation methods provide measures to evaluate the quality of a clustering partition on a given data set, and to determine the correct number of clusters. Recently, a new set of validation techniques based on the clusters' negentropy has been introduced. Negentropy-based cluster validation favors data partitions into compact clusters which are not strongly overlapped. Its evaluation is quite simple and it has been shown to perform better than other state of the art techniques. However, like many other cluster validation approaches, it presents problems when validating partitions where some regions contain only a few data points. Different heuristics have been proposed to cope with this problem, which are systematically analyzed in this paper. We study the performance of AIC, BIC, and four negentropy-based validation approaches in synthetic clustering problems of increasing dimensionality, with unbalanced clusters and different degree of overlapping. Our results suggest that negentropy-based validation techniques outperform AIC and BIC when the ratio of the number of points to the dimension is not high, which is a very common situation in most real applications.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 123, 10 January 2014, Pages 33–39
نویسندگان
, , , , ,