کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6855247 1437610 2018 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Closed frequent similar pattern mining: Reducing the number of frequent similar patterns without information loss
ترجمه فارسی عنوان
معدن الگوی مشابه مکرر بسته: کاهش تعداد الگوهای مشابه مکرر بدون از دست دادن اطلاعات
کلمات کلیدی
داده کاوی، الگوهای مکرر، داده های مختلط، توابع مشابهی، بستن پایین،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
Frequent pattern mining is considered a key task to discover useful information. Despite the quality of solutions given by frequent pattern mining algorithms, most of them face the challenge of how to reduce the number of frequent patterns without information loss. Frequent itemset mining addresses this problem by discovering a reduced set of frequent itemsets, named closed frequent itemsets, from which the entire frequent pattern set can be recovered. However, for frequent similar pattern mining, where the number of patterns is even larger than for Frequent itemset mining, this problem has not been addressed yet. In this paper, we introduce the concept of closed frequent similar pattern mining to discover a reduced set of frequent similar patterns without information loss. Additionally, a novel closed frequent similar pattern mining algorithm, named CFSP-Miner, is proposed. The algorithm discovers frequent patterns by traversing a tree that contains all the closed frequent similar patterns. To do this efficiently, several lemmas to prune the search space are introduced and proven. The results show that CFSP-Miner is more efficient than the state-of-the-art frequent similar pattern mining algorithms, except in cases where the number of frequent similar patterns and closed frequent similar patterns are almost equal. However, CFSP-Miner is able to find the closed similar patterns, yielding a reduced size of the discovered frequent similar pattern set without information loss. Also, CFSP-Miner shows good scalability while maintaining an acceptable runtime performance.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 96, 15 April 2018, Pages 271-283
نویسندگان
, , , , , ,