A Tabu search based clustering algorithm and its parallel implementation on Spark

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
6904135	1446997	2018	29 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Spark - جرقه Tabu search - جستجوی ممنوع یا تابو سرچ Clustering - خوشه بندی Parallel computing - رایانش موازی، محاسبات موازی k-Means - میانگین ـ کی Big Data - کلان داده

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر

پیش نمایش صفحه اول مقاله

A Tabu search based clustering algorithm and its parallel implementation on Spark

چکیده انگلیسی

The well-known K-means clustering algorithm has been employed widely in different application domains ranging from data analytics to logistics applications. However, the K-means algorithm can be affected by factors such as the initial choice of centroids and can readily become trapped in a local optimum. In this paper, we propose an improved K-means clustering algorithm that is augmented by a Tabu Search strategy, and which is better adapted to meet the needs of big data applications. Our design focuses on enhancements to take advantage of parallel processing based on the Spark framework. Computational experiments demonstrate the superiority of our parallel Tabu Search based clustering algorithm over a widely used version of the K-means approach embodied in the parallel Spark MLlib system, comparing the algorithms in terms of scalability, accuracy, and effectiveness.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Applied Soft Computing - Volume 63, February 2018, Pages 97-109

نویسندگان

Yinhao Lu, Buyang Cao, Cesar Rego, Fred Glover,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

A Tabu search based clustering algorithm and its parallel implementation on Spark

دسترسی سریع

ارتباط

English Website