Efficient algorithms for mining maximal high utility itemsets from data streams with different models

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
387315	660900	2012	14 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Data Stream Mining - معادن جریان داده Utility Mining - معدنکاری سودمند

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

Efficient algorithms for mining maximal high utility itemsets from data streams with different models

چکیده انگلیسی

Data stream mining is an emerging research topic in the data mining field. Finding frequent itemsets is one of the most important tasks in data stream mining with wide applications like online e-business and web click-stream analysis. However, two main problems existed in relevant studies: (1) The utilities (e.g., importance or profits) of items are not considered. Actual utilities of patterns cannot be reflected in frequent itemsets. (2) Existing utility mining methods produce too many patterns and this makes it difficult for the users to filter useful patterns among the huge set of patterns. In view of this, in this paper we propose a novel framework, named GUIDE (Generation of maximal high Utility Itemsets from Data strEams), to find maximal high utility itemsets from data streams with different models, i.e., landmark, sliding window and time fading models. The proposed structure, named MUI-Tree (Maximal high Utility Itemset Tree), maintains essential information for the mining processes and the proposed strategies further facilitates the performance of GUIDE. Main contributions of this paper are as follows: (1) To the best of our knowledge, this is the first work on mining the compact form of high utility patterns from data streams; (2) GUIDE is an effective one-pass framework which meets the requirements of data stream mining; (3) GUIDE generates novel patterns which are not only high utility but also maximal, which provide compact and insightful hidden information in the data streams. Experimental results show that our approach outperforms the state-of-the-art algorithms under various conditions in data stream environments on different models.

► We propose the topic of finding maximal high utility itemsets from data streams.
► The proposed one-pass framework, i.e., GUIDE, fits to limitations of data streams.
► Three algorithms are proposed for landmark, sliding window and time fading models.
► The proposed data structure and strategies facilitate the performance of GUIDE.
► The found patterns provide compact and insightful information in the data streams.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 39, Issue 17, 1 December 2012, Pages 12947–12960

نویسندگان

Bai-En Shie, Philip S. Yu, Vincent S. Tseng,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Efficient algorithms for mining maximal high utility itemsets from data streams with different models

دسترسی سریع

ارتباط

English Website