Article ID Journal Published Year Pages File Type
388091 Expert Systems with Applications 2012 13 Pages PDF
Abstract

Temporal regularity of itemset appearance can be regarded as an important criterion for measuring the interestingness of itemsets in several applications. A frequent itemset can be said to be regular-frequent in a database if it appears at a regular period. Therefore, the problem of mining a complete set of regular-frequent itemsets requires the specification of a support and a regularity threshold. However, in practice, it is often difficult for users to provide an appropriate support threshold. In addition, the use of a support threshold tends to produce a large number of regular-frequent itemsets and it might be better to ask for the number of desired results. We thus propose an efficient algorithm for mining top-k regular-frequent itemsets without setting a support threshold. Based on database partitioning and support estimation techniques, the proposed algorithm also uses a best-first search strategy with only one database scan. We then compare our algorithm with the state-of-the-art algorithms for mining top-k regular-frequent itemsets. Our experimental studies on both synthetic and real data show that our proposal achieves high performance for small and large values of k.

► The number of desired regular-frequent itemsets is specificable. ► A top-k regular-frequent itemsets mining algorithm is proposed. ► A database partitioning technique is applied. ► A support estimation technique is introduced. ► The proposed algorithm achieves high performance for sparse datasets.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , ,