Clustering large data with uncertainty

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
496100	862850	2013	7 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Clustering - خوشه بندی Data mining - داده‌کاوی Fuzzy sets - مجموعه‌های فازی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر

پیش نمایش صفحه اول مقاله

چکیده انگلیسی

A new algorithm is designed for handling fuzziness while mining large data. A new novel cost function weighted by fuzzy membership, is proposed in the framework of CLARANS. A new scalable approximation to the maximum number of neighbors, explored at each node, is developed; thus reducing the computational time for large data while eliminating the need for user-defined (heuristic) parameters in the existing equation. The goodness of the generated clusters is evaluated in terms of Xie–Beni validity index. Results demonstrate the superiority of the proposed algorithm, over both synthetic and real data sets, in terms of goodness of clustering. It is interesting to note that our algorithm always converges to the globally best values at the optimal number of partitions. Moreover compared to existing fuzzy algorithms, FCLARANS without scanning the whole dataset, searching small number of neighbors, is able to handle the uncertainty due to overlapping nature of the various partitions. This is the main motivation of fuzzification of the algorithm CLARANS.

Variation of threshold Ncross with the number of clusters c. Figure optionsDownload as PowerPoint slideHighlights
► A fuzzified version of the algorithm CLARANS is designed.
► The cost function is weighted by fuzzy membership in the framework of CLARANS.
► A new scalable approximation to the maximum number of neighbors, explored at each node.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Applied Soft Computing - Volume 13, Issue 4, April 2013, Pages 1639–1645

نویسندگان

Sampreeti Ghosh, Sushmita Mitra,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Clustering large data with uncertainty

دسترسی سریع

ارتباط

English Website