کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
430282 687959 2012 21 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
P2P-MapReduce: Parallel data processing in dynamic Cloud environments
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
P2P-MapReduce: Parallel data processing in dynamic Cloud environments
چکیده انگلیسی

MapReduce is a programming model for parallel data processing widely used in Cloud computing environments. Current MapReduce implementations are based on centralized master-slave architectures that do not cope well with dynamic Cloud infrastructures, like a Cloud of clouds, in which nodes may join and leave the network at high rates. We have designed an adaptive MapReduce framework, called P2P-MapReduce, which exploits a peer-to-peer model to manage node churn, master failures, and job recovery in a decentralized but effective way, so as to provide a more reliable MapReduce middleware that can be effectively exploited in dynamic Cloud infrastructures. This paper describes the P2P-MapReduce system providing a detailed description of its basic mechanisms, a prototype implementation, and an extensive performance evaluation in different network scenarios. The performance results confirm the good fault tolerance level provided by the P2P-MapReduce framework compared to a centralized implementation of MapReduce, as well as its limited impact in terms of network overhead.


► P2P-MapReduce, an adaptive MapReduce framework to manage node churn and master failures
► A reliable MapReduce middleware to be effectively used in dynamic Cloud infrastructures
► Performance results confirm high fault tolerance level of the P2P-MapReduce framework
► A software framework for job recovery for large-scale Cloud computing.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Computer and System Sciences - Volume 78, Issue 5, September 2012, Pages 1382–1402
نویسندگان
, , ,