کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
432699 689037 2014 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Monitoring persistent items in the union of distributed streams
ترجمه فارسی عنوان
نظارت بر موارد مداوم در اتحاد جریانهای توزیع شده
کلمات کلیدی
جریانهای توزیع شده، اقلام مداوم
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی


• We consider identification of persistent items from a distributed stream.
• Persistent items have been found to be associated with network anomalies.
• We present a communication-efficient distributed algorithm for tracking.
• Proofs or correctness and communication complexity.
• Experimental results reinforcing the theory.

A persistent item in a stream is one that occurs regularly in the stream without necessarily contributing significantly to the volume of the stream. Persistent items are often associated with anomalies in network streams, such as botnet traffic and click fraud. While it is important to track persistent items in an online manner, it is challenging to zero-in on such items in a massive distributed stream. We present the first communication-efficient distributed algorithms for tracking persistent items in a data stream whose elements are partitioned across many different sites. We consider both infinite window and sliding window settings, and present algorithms that can track persistent items approximately with a probabilistic guarantee on the approximation error. Our algorithms have a provably low communication cost, and a low rate of false positives and false negatives, with a high probability. We present detailed results from an experimental evaluation that show the communication cost is small, and that the false positive and false negative rates are typically much lower than theoretical guarantees.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 74, Issue 11, November 2014, Pages 3115–3127
نویسندگان
, ,