Article ID Journal Published Year Pages File Type
382069 Expert Systems with Applications 2015 14 Pages PDF
Abstract

•An introduction of a new type of anomalies in computing system management.•An incremental algorithm to discover the contextual collective anomalies in real time.•A flexible three-stage framework to discover anomalies from multiple data streams.•Empirical evaluation through the real world scenario.

Identifying the anomalies is a critical task to maintain the uptime of the monitored distributed systems. For this reason, the trace data collected from real time monitors are often provided in form of streams for anomaly detection. Due to the dramatic increase of the scale of modern distributed systems, it is challenging to effectively and efficiently discover the anomalies from a voluminous amount of noisy and high-dimensional data streams. Moreover, the evolving of the system infrastructures brings new anomaly types that cannot be generalized as existing ones, making the existing anomaly detection solutions unavailable.To address these issues, in this paper, we introduce a new type of anomalies called contextual collective anomaly. Then we propose a framework to discover this type of anomaly over a collection of data streams in real time. A primary advantage of this solution is that it can accurately identify the anomalies by taking both the contextual information and the historical information of a data stream into consideration. Also, the proposed framework is designed in a way with a low computational cost, and is able to handle large-scale data streams. To demonstrate the effectiveness and efficiency of our proposed framework, we empirically validate it on a real world cluster.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , , ,