کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
241924 501790 2016 18 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Fast algorithms for mining high-utility itemsets with various discount strategies
ترجمه فارسی عنوان
الگوریتم های سریع برای مجموعه اقلام بالا استخراج از معادن با ابزار استراتژی های کاهشی مختلف
کلمات کلیدی
مجموعه اقلام بالا ابزار؛ استراتژی های کاهشی ؛ اموال بسته شدن رو به پایین. استراتژی های هرس؛ دانشگاه پیام نور لیست
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

In recent years, mining high-utility itemsets (HUIs) has emerged as a key topic in data mining. It consists of discovering sets of items generating a high profit in a transactional database by considering both purchase quantities and unit profits of items. Many algorithms have been proposed for this task. However, most of them assume the unrealistic assumption that unit profits of items remain unchanged over time. But in real-life, the profit of an item or itemset varies as a function of cost prices, sales prices and sale strategies. Recently, a three-phase algorithm has been proposed to mine HUIs, while considering that each item may have different discount strategies. However, the complete set of HUIs cannot be retrieved based on the traditional TWU model with its defined discount strategies. Moreover, it suffers from the well-known drawbacks of Apriori-based algorithms such as maintaining a huge amount of candidates in memory and repeatedly performing time-consuming database scans. In this paper, a HUI-DTP algorithm for mining HUIs when considering discount strategies of items is introduced. The HUI-DTP is designed as a two-phase algorithm to mine the complete set of HUIs based on a novel downward closure property and a vertical TID-list structure. Furthermore, the HUI-DMiner is an algorithm relying on a compact data structure (Positive-and-Negative Utility-list, PNU-list) and properties of two new pruning strategies to efficiently discover HUIs without candidate generation, while considerably reducing the size of the search space. Moreover, a strategy named Estimated Utility Co-occurrence Strategy which stores the relationships between 2-itemsets is also applied in the improved HUI-DEMiner algorithm to speed up computation. An extensive experimental study carried on several real-life datasets shows that the proposed algorithms outperform the previous best algorithm in terms of runtime, memory consumption and scalability.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Advanced Engineering Informatics - Volume 30, Issue 2, April 2016, Pages 109–126
نویسندگان
, , , , ,