کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
484193 703257 2016 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
An Improved PrePost Algorithm for Frequent Pattern Mining with Hadoop on Cloud
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
پیش نمایش صفحه اول مقاله
An Improved PrePost Algorithm for Frequent Pattern Mining with Hadoop on Cloud
چکیده انگلیسی

Due to the advancement in internet technologies the volume of data is tremendously increasing day by day. The research is gaining importance in extracting valuable information from such huge amount of data. Many research works are done and various algorithms are proposed. The PrePost algorithm is one of well-known algorithms of frequent pattern mining. It is based on N-list data structure to mine frequent item-sets. But the performance of PrePost algorithm degrades when it comes to processing of large amount of data. Hadoop is very well known technique for processing such large amount of data. This paper proposes the Improved PrePost algorithm which combines the features of Hadoop in order to process large data efficiently. Efficiency of PrePost algorithm is enhanced by implementing compact PPC tree with the general tree method and finding frequent itemsets without generating candidate itemsets. An architecture of the Improved PrePost algorithm with public cloud is proposed. The results show that as dataset size is increased, the Improved PrePost algorithm gives 60% better performance.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 79, 2016, Pages 207-214