کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6856863 1437971 2018 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Finding the hottest item in data streams
ترجمه فارسی عنوان
پیدا کردن داغترین مورد در جریان داده ها
کلمات کلیدی
الگوریتم آنلاین، داغترین فاصله، جریان مورد،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
We study a problem of finding the hottest item interval in a data stream, where the hotness of an item over an interval is determined by its average frequency. Finding the hottest item interval is particularly helpful in business promotions, such as monitoring the peak sales records, finding the hottest period in an online game, digging the highest click rate of an online music, etc. Existing work focus on finding the most frequent item over a fixed length interval. However, these solutions cannot return the hottest interval since the best length (i.e., maximizing the average frequency) is unknown in advance. To discover the hottest item interval, a straightforward solution is to calculate the average frequencies of items for every possible interval length, which is too costly for stream applications. To efficiently compute the hottest item interval, we propose an algorithm that employs the arrival timestamps of items and reduce the search space by three pruning strategies. Extensive experiments show that the proposed algorithms can efficiently discover the hottest item interval on both real and synthetic datasets.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volumes 430–431, March 2018, Pages 314-330
نویسندگان
, , , , , ,