Mining top-k regular-frequent itemsets using database partitioning and support estimation

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
388091	660916	2012	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Data mining - داده‌کاوی Association rule - قانون انجمن Frequent Itemset - مؤسسه مکرر

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

Mining top-k regular-frequent itemsets using database partitioning and support estimation

چکیده انگلیسی

Temporal regularity of itemset appearance can be regarded as an important criterion for measuring the interestingness of itemsets in several applications. A frequent itemset can be said to be regular-frequent in a database if it appears at a regular period. Therefore, the problem of mining a complete set of regular-frequent itemsets requires the specification of a support and a regularity threshold. However, in practice, it is often difficult for users to provide an appropriate support threshold. In addition, the use of a support threshold tends to produce a large number of regular-frequent itemsets and it might be better to ask for the number of desired results. We thus propose an efficient algorithm for mining top-k regular-frequent itemsets without setting a support threshold. Based on database partitioning and support estimation techniques, the proposed algorithm also uses a best-first search strategy with only one database scan. We then compare our algorithm with the state-of-the-art algorithms for mining top-k regular-frequent itemsets. Our experimental studies on both synthetic and real data show that our proposal achieves high performance for small and large values of k.

► The number of desired regular-frequent itemsets is specificable.
► A top-k regular-frequent itemsets mining algorithm is proposed.
► A database partitioning technique is applied.
► A support estimation technique is introduced.
► The proposed algorithm achieves high performance for sparse datasets.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 39, Issue 2, 1 February 2012, Pages 1924–1936

نویسندگان

Komate Amphawan, Philippe Lenca, Athasit Surarerks,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Mining top-k regular-frequent itemsets using database partitioning and support estimation

دسترسی سریع

ارتباط

English Website