Article ID Journal Published Year Pages File Type
461135 Journal of Systems and Software 2013 10 Pages PDF
Abstract

In distributed systems, resource prediction is an important but difficult topic. In many cases, multiple prediction is needed rather than only performing prediction at a single future point in time. However, traditional approaches are not sufficient for multi-step-ahead prediction. We introduce a pattern fusion model to predict multi-step-ahead CPU loads. In this model, similar patterns are first extracted from the historical data via calculating Euclidean distance and fluctuation pattern distance between historical patterns and current sequence. For a given pattern length, multiple similar patterns of this length can often be found and each of them can produce a prediction. We also propose a pattern weight strategy to merge these prediction. Finally, a machine learning algorithm is used to combine the prediction results obtained from different length pattern sets dynamically. Empirical results on four real-world production servers show that this approach achieves higher accuracy on average than existing approaches for multi-step-ahead prediction.

► A resource prediction method with pattern fusion algorithm in distributed system is novel. ► Our method provides multi-step-ahead algorithms to improve the prediction efficiency. ► The experiment performs accurate results and low prediction error. ► Our method is tested by real datasets of four clusters and is compared with other algorithms. ► This model provides support for resource management and scheduling in distributed system and cloud computing.

Related Topics
Physical Sciences and Engineering Computer Science Computer Networks and Communications
Authors
, , , , ,