دانلود رایگان مقاله: گروه بر اساس خوشه بندی فازی خشن برای داده های طبقه بندی شده است

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
404943	677466	2015	14 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Ensemble based rough fuzzy clustering for categorical data

ترجمه فارسی عنوان

گروه بر اساس خوشه بندی فازی خشن برای داده های طبقه بندی شده است

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

داده های طبقه بندی شده گروهی فراگیری ماشین، خوشه فازی خفیف، معیار شباهت / ناهماهنگی، آزمون آماری

statistical test - آزمون آماری Categorical data - داده های طبقه بندی شده Ensemble - گروهی Machine learning - یادگیری ماشین

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

گروه بر اساس خوشه بندی فازی خشن برای داده های طبقه بندی شده است

چکیده انگلیسی

Categorical data is different from continuous data, where the values of attribute do not follow any natural ordering. Moreover, inherent complexities like uncertainty, vagueness and overlapping among clusters make the analysis of real life categorical data set more difficult. Recent literature review shows that the well-known categorical data clustering techniques are using different similarity/dissimilarity measures to tackle the inherent complexities of the categorical attribute values. Generally, it is hard to find single method and cluster validity measure that can be used as perfect or standard for all kinds of categorical data sets. Hence, in this paper first, a clustering method for categorical data is proposed by fusing rough set and fuzzy set theories. Subsequently, an ensemble based framework is designed with the recently proposed similarity/dissimilarity measures in order to have better clustering results for different types of categorical data sets. For this purpose, the proposed rough fuzzy clustering method is used sequentially with the integration of different measures to evolve the clustering solutions. Using consensus of these solutions, pure classified, semi rough and pure rough points are identified. Thereafter, machine learning method, called Random Forest, is used in incremental way to classify the semi and pure rough points using pure classified points to yield better clustering results. The performance of the proposed method has been demonstrated in comparison with several other recently developed clustering methods. Additionally, the selection of Random Forest in the proposed framework is justified by comparing its performance with other well-known machine learning methods like K-Nearest Neighbor and Support Vector Machine. Ten categorical data sets are used for the experimental purpose. Finally, statistical significance test has been conducted to judge the superiority of the results.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 77, March 2015, Pages 114–127

نویسندگان

Indrajit Saha, Jnanendra Prasad Sarkar, Ujjwal Maulik,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : گروه بر اساس خوشه بندی فازی خشن برای داده های طبقه بندی شده است

دسترسی سریع

ارتباط

English Website