A method to compute distance between two categorical values of same attribute in unsupervised learning for categorical data set

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
535681	870360	2007	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Categorical data - داده های طبقه بندی شده Co-occurrences - رویدادهای مشترک Similarity - شباهت Unsupervised learning - یادگیری بدون نظارت

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو

پیش نمایش صفحه اول مقاله

A method to compute distance between two categorical values of same attribute in unsupervised learning for categorical data set

چکیده انگلیسی

Computation of similarity between categorical data objects in unsupervised learning is an important data mining problem. We propose a method to compute distance between two attribute values of same attribute for unsupervised learning. This approach is based on the fact that similarity of two attribute values is dependent on their relationship with other attributes. Computational cost of this method is linear with respect to number of data objects in data set. To see the effectiveness of our proposed distance measure, we use proposed distance measure with K-mode clustering algorithm to cluster various categorical data sets. Significant improvement in clustering accuracy is observed as compared to clustering results obtained using traditional K-mode clustering algorithm.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 28, Issue 1, 1 January 2007, Pages 110–118

نویسندگان

Amir Ahmad, Lipika Dey,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

A method to compute distance between two categorical values of same attribute in unsupervised learning for categorical data set

دسترسی سریع

ارتباط

English Website