کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
431489 688560 2014 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Google hostload prediction based on Bayesian model with optimized feature combination
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Google hostload prediction based on Bayesian model with optimized feature combination
چکیده انگلیسی


• We devise an exponentially segmented pattern model for the hostload prediction.
• We devise a Bayes method and exploit 10 features to find the best-fit combination.
• We evaluate the Bayes method and 8 other well-known load prediction methods.
• The experiment is based on Google trace with over 10 k hosts and millions of jobs.
• The pattern prediction with Bayes method has much higher precision than others.

We design a novel prediction method with Bayes model to predict a load fluctuation pattern over a long-term interval, in the context of Google data centers. We exploit a set of features that capture the expectation, trend, stability and patterns of recent host loads. We also investigate the correlations among these features and explore the most effective combinations of features with various training periods. All of the prediction methods are evaluated using Google trace with 10,000+ heterogeneous hosts. Experiments show that our Bayes method improves the long-term load prediction accuracy by 5.6%–50%, compared to other state-of-the-art methods based on moving average, auto-regression, and/or noise filters. Mean squared error of pattern prediction with Bayes method can be approximately limited in [10−8,10−5][10−8,10−5]. Through a load balancing scenario, we confirm the precision of pattern prediction in finding a set of idlest/busiest hosts from among 10,000+ hosts can be improved by about 7% on average.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 74, Issue 1, January 2014, Pages 1820–1832
نویسندگان
, , ,