کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
433062 | 689225 | 2012 | 11 صفحه PDF | دانلود رایگان |

In a Grid computing system, many distributed scientific and engineering applications often require multi-institutional collaboration, large-scale resource sharing, wide-area communication, etc. Applications executing in such systems inevitably encounter different types of failures such as hardware failure, program failure, and storage failure. One way of taking failures into account is to employ a reliable scheduling algorithm. However, most existing Grid scheduling algorithms do not adequately consider the reliability requirements of an application. In recognition of this problem, we design a hierarchical reliability-driven scheduling architecture that includes both a local scheduler and a global scheduler. The local scheduler aims to effectively measure task reliability of an application in a Grid virtual node and incorporate the precedence constrained tasks’ reliability overhead into a heuristic scheduling algorithm. In the global scheduler, we propose a hierarchical reliability-driven scheduling algorithm based on quantitative evaluation of independent application reliability. Our experiments, based on both randomly generated graphs and the graphs of some real applications, show that our hierarchical scheduling algorithm performs much better than the existing scheduling algorithms in terms of system reliability, schedule length, and speedup.
► This paper designs a hierarchical reliability-driven scheduling architecture in Grid.
► This paper proposes a Grid VN reliability analysis model.
► We incorporate the task’s reliability overhead into a local scheduling algorithm.
► This paper proposes a Grid application’s reliability driven scheduling algorithm.
► The comparison studies show that our algorithm has good performance.
Journal: Journal of Parallel and Distributed Computing - Volume 72, Issue 4, April 2012, Pages 525–535