Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
461135 | Journal of Systems and Software | 2013 | 10 Pages |
In distributed systems, resource prediction is an important but difficult topic. In many cases, multiple prediction is needed rather than only performing prediction at a single future point in time. However, traditional approaches are not sufficient for multi-step-ahead prediction. We introduce a pattern fusion model to predict multi-step-ahead CPU loads. In this model, similar patterns are first extracted from the historical data via calculating Euclidean distance and fluctuation pattern distance between historical patterns and current sequence. For a given pattern length, multiple similar patterns of this length can often be found and each of them can produce a prediction. We also propose a pattern weight strategy to merge these prediction. Finally, a machine learning algorithm is used to combine the prediction results obtained from different length pattern sets dynamically. Empirical results on four real-world production servers show that this approach achieves higher accuracy on average than existing approaches for multi-step-ahead prediction.
► A resource prediction method with pattern fusion algorithm in distributed system is novel. ► Our method provides multi-step-ahead algorithms to improve the prediction efficiency. ► The experiment performs accurate results and low prediction error. ► Our method is tested by real datasets of four clusters and is compared with other algorithms. ► This model provides support for resource management and scheduling in distributed system and cloud computing.