Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
6873099 | Future Generation Computer Systems | 2018 | 19 Pages |
Abstract
Big data analysis jobs on clouds are gaining more and more popularity in recent years. It is critical but challenging to pick the right configuration for an incoming job, since the configuration space is too large, and the relationship between allocated resources and job performance is not deterministic. In this paper, we propose SERAC3 to allocate resources smartly and economically for big data clusters in community clouds. SERAC3 is a system that can automatically extract representative workloads from incoming big data analysis jobs, smartly decide an optimal configuration for each job, and adjust its assigning strategy in a quasi-realtime mode. With experiments on a community cloud built on OpenStack, we show that on average, SERAC3 can smartly select a configuration within 2.2% of the exact optimal one, while saving about 80.1% search cost compared to the exhaustive search.
Keywords
Related Topics
Physical Sciences and Engineering
Computer Science
Computational Theory and Mathematics
Authors
Junnan Li, Zhihui Lu, Wei Zhang, Jie Wu, Hao Qiang, Bo Li, Patrick C.K. Hung,