کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
524055 868548 2014 21 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
X10-FT: Transparent fault tolerance for APGAS language and runtime
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
X10-FT: Transparent fault tolerance for APGAS language and runtime
چکیده انگلیسی


• We make the first attempt to add fault tolerance support to the AGPAS programming model.
• Leverage the great advances in distributed systems such like DFS and Paxos in X10-FT.
• The X10-FT framework is transparent to the programmers in most cases by using the X10 language constructs.
• We implement a prototype. The evaluation of four practical benchmarks shows that X10-FT has modest overhead.

The asynchronous partitioned global address space (APGAS) model is a programming model aiming at unifying programming on multicore and clusters, with good productivity. However, it currently lacks support for fault tolerance (FT) such that a single transient failure may render hours to months of computation useless.In this paper, we thoroughly analyze the feasibility of providing fault tolerance for APGAS model and make the first attempt to add fault tolerance support to an APGAS language called X10. Based on the analysis, we design and implement a fault-tolerance framework called X10-FT that leverages renowned techniques in distributed systems like distributed file systems and Paxos, as well as specific solutions based on the characteristics of the APGAS model to make checkpoints and consensus. This allows the system to transparently handle machine failures at different granularities. Using the features of the APGAS model, we extend the X10 compiler to automatically locate execution points to checkpoint program states without any intervention from programmers. Evaluation using a set of benchmarks shows that the cost for fault tolerance is modest.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Parallel Computing - Volume 40, Issue 2, February 2014, Pages 136–156
نویسندگان
, , , ,