کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6902234 1446500 2017 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Frequent pattern mining on stream data using Hadoop CanTree-GTree
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
پیش نمایش صفحه اول مقاله
Frequent pattern mining on stream data using Hadoop CanTree-GTree
چکیده انگلیسی
The need for knowledge discovery from real-time stream data is continuously increasing nowadays and processing of transactions for mining patterns needs efficient data structures and algorithms. We propose a time-efficient Hadoop CanTree-GTree algorithm, using Apache Hadoop. This algorithm mines the complete frequent item sets (patterns) from real time transactions, by utilizing the sliding window technique. These are used to mine for closed frequent item sets and then, association rules are derived. It makes use of two data structures - CanTree and GTree. The results show that the Hadoop implementation of the algorithm performs 5 times better than in Java.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 115, 2017, Pages 266-273
نویسندگان
, , , ,