کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
712146 1461146 2014 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A Control Approach for Performance of Big Data Systems
ترجمه فارسی عنوان
یک روش کنترل برای عملکرد سیستم های اطلاعات بزرگ
کلمات کلیدی
رد تراکم، سیستم های کنترل خطی، کنترل برای کامپیوتر، پردازش ابری، اطلاعات بزرگ
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی مکانیک محاسباتی
چکیده انگلیسی

We are at the dawn of a huge data explosion therefore companies have fast growing amounts of data to process. For this purpose Google developed MapReduce, a parallel programming paradigm which is slowly becoming the de facto tool for Big Data analytics. Although to some extent its use is already wide-spread in the industry, ensuring performance constraints for such a complex system poses great challenges and its management requires a high level of expertise. This paper answers these challenges by providing the first autonomous controller that ensures service time constraints of a concurrent MapReduce workload. We develop the first dynamic model of a MapReduce cluster. Furthermore, PI feedback control is developed and implemented to ensure service time constraints. A feedforward controller is added to improve control response in the presence of disturbances, namely changes in the number of clients. The approach is validated online on a real 40 node MapReduce cluster, running a data intensive Business Intelligence workload. Our experiments demonstrate that the designed control is successful in assuring service time constraints.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: IFAC Proceedings Volumes - Volume 47, Issue 3, 2014, Pages 152–157
نویسندگان
, , , , ,