کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
431496 688560 2014 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Fair scheduling of bag-of-tasks applications using distributed Lagrangian optimization
ترجمه فارسی عنوان
برنامه ریزی عادلانه برنامه های کیسه های کاری با استفاده از بهینه سازی لاگرانژی توزیع شده
کلمات کلیدی
بهینه سازی لاگرانژی، برنامه ریزی حالت پایدار، برنامه ریزی توزیع، محاسبات شبکه
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی


• A robust, fair, and optimal distributed scheduling algorithm for concurrent BoT applications with arbitrary CCR is proposed.
• Despite similarity, this context is more complicated than multi-path flow control.
• Non-trivial adaptations of Distributed Lagrangian Optimization are required.
• Experimental proof of convergence is given for large platforms.

Large scale distributed systems typically comprise hundreds to millions of entities (applications, users, companies, universities) that have only a partial view of resources (computers, communication links). How to fairly and efficiently share such resources between entities in a distributed way has thus become a critical question.Although not all applications are suitable for execution on large scale distributed computing platform, ideal are the Bag-of-Tasks (BoT) applications. Hence a large fraction of jobs in workloads imposed on Grids is made of sequential applications submitted in the form of BoTs. Up until now, mainly simple mechanisms have been used to ensure a fair sharing of resources among these applications. Although these mechanisms are proved to be efficient for CPU-bound applications, they are known to be ineffective in the presence of network-bound applications.A possible answer resorts to Lagrangian optimization and distributed gradient descent. Under certain conditions, the resource sharing problem can be formulated as a global optimization problem, which can be solved by a distributed self-stabilizing supply and demand algorithm. In the last decade, this technique has been applied to design various network protocols (variants of TCP, multi-path network protocols, wireless network protocols) and even distributed algorithms for smart grids.In this article, we explain how to use this technique for fairly scheduling concurrent BoT applications with arbitrary communication-to-computation ratio on a Grid. Yet, application heterogeneity raises severe convergence and stability issues that did not appear in the previous contexts and need to be addressed by non-trivial modifications. The effectiveness of our proposal is assessed through an extensive set of complex and realistic simulations.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 74, Issue 1, January 2014, Pages 1914–1929
نویسندگان
, , , ,