کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4944146 1437981 2017 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Combining Apriori heuristic and bio-inspired algorithms for solving the frequent itemsets mining problem
ترجمه فارسی عنوان
ترکیب الگوریتم های آپروری اکتشافی و الهام گرفته از زیست محیطی برای حل معادلات مکرر مجموعه
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
Exact approaches to Frequent Itemsets Mining (FIM) are characterised by poor runtime performance when dealing with large database instances. Several FIM bio-inspired approaches have been proposed to overcome this issue. These are considerably more efficient from the point of view of runtime performance, but they still yield poor quality solutions. The quality of the solution, i.e., the number of frequent itemsets discovered, can be increased by improving the randomised search of the solutions space considering intrinsic features of the FIM problem. This paper proposes a new framework for FIM bio-inspired approaches that considers the recursive property of frequent itemsets, i.e., the same feature exploited by the Apriori exact heuristic, in the search of the solution space. We define two new approaches to FIM, namely GA-Apriori and PSO-Apriori, based on the proposed framework, which use genetic algorithms and particle swarm optimisation, respectively. Extensive experiments on synthetic and real database instances show that the proposed approaches outperform other bio-inspired ones in terms of runtime performance. The results also reveal that the performance of PSO-Apriori is comparable to the one of exact approaches Apriori and FPGrowth in respect of the quality of solutions found. We also show that PSO-Apriori outperforms the recently developed BATFIM algorithm when dealing with very large database instances.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 420, December 2017, Pages 1-15
نویسندگان
, ,