A load-balanced distributed parallel mining algorithm

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
384302	660843	2010	6 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Frequent Patterns - الگوهای مکرر Data mining - داده‌کاوی association rules - قوانین وابستگی Cluster computing - محاسبات خوشه ای Parallel and distributed processing - پردازش موازی و توزیع شده

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

A load-balanced distributed parallel mining algorithm

چکیده انگلیسی

Due to the exponential growth in worldwide information, companies have to deal with an ever growing amount of digital information. One of the most important challenges for data mining is quickly and correctly finding the relationship among data. The Apriori algorithm has been the most popular technique in finding frequent patterns. However, when applying this method, a database has to be scanned many times to calculate the counts of a huge number of candidate itemsets. Parallel and distributed computing is an effective strategy for accelerating the mining process. In this paper, the Distributed Parallel Apriori (DPA) algorithm is proposed as a solution to this problem. In the proposed method, metadata are stored in the form of Transaction Identifiers (TIDs), such that only a single scan to the database is needed. The approach also takes the factor of itemset counts into consideration, thus generating a balanced workload among processors and reducing processor idle time. Experiments on a PC cluster with 16 computing nodes are also made to show the performance of the proposed approach and compare it with some other parallel mining algorithms. The experimental results show that the proposed approach outperforms the others, especially while the minimum supports are low.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 37, Issue 3, 15 March 2010, Pages 2459–2464

نویسندگان

Kun-Ming Yu, Jiayi Zhou, Tzung-Pei Hong, Jia-Ling Zhou,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

A load-balanced distributed parallel mining algorithm

دسترسی سریع

ارتباط

English Website