Projected clustering for categorical datasets

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
535323	870340	2006	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

cluster validation - اعتبارسنجی خوشه Clustering algorithm - الگوریتم خوشه بندی Cluster validity index - شاخص اعتبار خوشه Unsupervised learning - یادگیری بدون نظارت

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو

پیش نمایش صفحه اول مقاله

Projected clustering for categorical datasets

چکیده انگلیسی

This paper deals with the problem of clustering categorical datasets. Categorical data typically suffer from limited measuring levels and exhibit sparsity in a space of very high dimension. Conventional dissimilarity measures are, therefore, inadequate. We propose a new clustering algorithm based on projected clustering. The proposed algorithm, although hierarchical in essence, avoids the characteristic error propagation through reassignment and deletion of bad clusters. We also propose new indices for cluster validation in categorical datasets, an area that is almost unexplored. We present techniques for finding optimal number of clusters, and for initialization of centers of clusters. Experimental results demonstrate the effectiveness of the proposed clustering algorithm. The cluster validation for categorical datasets is also shown to be quite efficient.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 27, Issue 12, September 2006, Pages 1405–1417

نویسندگان

Minho Kim, R.S. Ramakrishna,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Projected clustering for categorical datasets

دسترسی سریع

ارتباط

English Website