کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
396858 670608 2007 25 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
CFP-tree: A compact disk-based structure for storing and querying frequent itemsets
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
CFP-tree: A compact disk-based structure for storing and querying frequent itemsets
چکیده انگلیسی

Frequent itemset mining is an important problem in the data mining area with a wide range of applications. Many decision support systems need to support online interactive frequent itemset mining, which is a challenging task because frequent itemset mining is a computation intensive repetitive process. One solution is to precompute frequent itemsets. In this paper, we propose a compact disk-based data structure—CFP-tree to store precomputed frequent itemsets on a disk to support online mining requests. The CFP-tree structure effectively utilizes the redundancy in frequent itemsets to save space. The compressing ratio of a CFP-tree can be as high as several thousands or even higher. Efficient algorithms for retrieving frequent itemsets from a CFP-tree, as well as efficient algorithms to construct and maintain a CFP-tree, are developed. Our performance study demonstrates that with a CFP-tree, frequent itemset mining requests can be responded to promptly.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Systems - Volume 32, Issue 2, April 2007, Pages 295–319
نویسندگان
, , ,