کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
536543 870551 2011 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A k-means type clustering algorithm for subspace clustering of mixed numeric and categorical datasets
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
A k-means type clustering algorithm for subspace clustering of mixed numeric and categorical datasets
چکیده انگلیسی

Almost all subspace clustering algorithms proposed so far are designed for numeric datasets. In this paper, we present a k-means type clustering algorithm that finds clusters in data subspaces in mixed numeric and categorical datasets. In this method, we compute attributes contribution to different clusters. We propose a new cost function for a k-means type algorithm. One of the advantages of this algorithm is its complexity which is linear with respect to the number of the data points. This algorithm is also useful in describing the cluster formation in terms of attributes contribution to different clusters. The algorithm is tested on various synthetic and real datasets to show its effectiveness. The clustering results are explained by using attributes weights in the clusters. The clustering results are also compared with published results.

Research highlights
► We propose an algorithm for the subspace clustering of mixed datasets.
► This is a linear algorithm and gives information about the cluster formation.
► The experiments show its effectiveness on categorical datasets and on mixed datasets.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 32, Issue 7, 1 May 2011, Pages 1062–1069
نویسندگان
, ,