Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4945147 | Information Systems | 2017 | 16 Pages |
Abstract
Thanks to efficient enumeration and pruning strategies, TopPI avoids the search space explosion induced by mining low support itemsets. We show how TopPI can be parallelized on multi-cores and distributed on Hadoop clusters. Our experiments on datasets with different characteristics show the superiority of TopPI when compared to standard top-k solutions, and to Parallel FP-Growth, its closest competitor.
Related Topics
Physical Sciences and Engineering
Computer Science
Artificial Intelligence
Authors
V. Leroy, M. Kirchgessner, A. Termier, S. Amer-Yahia,