Article ID Journal Published Year Pages File Type
458338 Journal of Systems and Software 2016 13 Pages PDF
Abstract

•A framework to manage interactive MapReduce applications with deadline constraint.•A dispatcher to assign jobs to resources considering blocking and data-locality.•A dynamic power management for MapReduce tasks to improve run-time energy efficiency.•A schedulability test to ensure that all MapReduce tasks meet the timing constraints.

MapReduce is widely used in cloud applications for large-scale data processing. The increasing number of interactive cloud applications has led to an increasing need for MapReduce real-time scheduling. Most MapReduce applications are data-oriented and nonpreemptively executed. Therefore, the problem of MapReduce real-time scheduling is complicated because of the trade-off between run-time blocking for nonpreemptive execution and data-locality. This paper proposes a data-locality-aware MapReduce real-time scheduling framework for guaranteeing quality of service for interactive MapReduce applications. A scheduler and dispatcher that can be used for scheduling two-phase MapReduce jobs and for assigning jobs to computing resources are presented, and the dispatcher enable the consideration of blocking and data-locality. Furthermore, dynamic power management for run-time energy saving is discussed. Finally, the proposed methodology is evaluated by considering synthetic workloads, and a comparative study of different scheduling algorithms is conducted.

Related Topics
Physical Sciences and Engineering Computer Science Computer Networks and Communications
Authors
, ,