کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
405301 677525 2011 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
An initialization method to simultaneously find initial cluster centers and the number of clusters for clustering categorical data
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
An initialization method to simultaneously find initial cluster centers and the number of clusters for clustering categorical data
چکیده انگلیسی

The leading partitional clustering technique, k-modes, is one of the most computationally efficient clustering methods for categorical data. However, in the k-modes-type algorithms, the performance of their clustering depends on initial cluster centers and the number of clusters needs be known or given in advance. This paper proposes a novel initialization method for categorical data which is implemented to the k-modes-type algorithms. The proposed method can not only obtain the good initial cluster centers but also provide a criterion to find candidates for the number of clusters. The performance and scalability of the proposed method has been studied on real data sets. The experimental results illustrate that the proposed method is effective and can be applied to large data sets for its linear time complexity with respect to the number of data points.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 24, Issue 6, August 2011, Pages 785–795
نویسندگان
, , ,