کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
430789 688147 2007 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Towards highly available and scalable high performance clusters
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Towards highly available and scalable high performance clusters
چکیده انگلیسی

In recent years, we have witnessed a growing interest in high performance computing (HPC) using a cluster of workstations. This growth made it affordable to individuals to have exclusive access to their own supercomputers. However, one of the challenges in a clustered environment is to keep system failure to the minimum and to achieve the highest possible level of system availability. High-Availability (HA) computing attempts to avoid the problems of unexpected failures through active redundancy and preemptive measures. Since the price of hardware components are significantly dropping, we propose to combine both HPC and HA concepts and layout the design of a HA–HPC cluster, considering all possible measures. In particular, we explore the hardware and the management layers of the HA–HPC cluster design, as well as a more focused study on the parallel-applications layer (i.e. FT-MPI implementations). Our findings show that combining HPC and HA architectures is feasible, in order to achieve HA cluster that is used for High Performance Computing.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Computer and System Sciences - Volume 73, Issue 8, December 2007, Pages 1240-1251