کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
425824 685921 2016 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Extending ττ-Lop to model concurrent MPI communications in multicore clusters
ترجمه فارسی عنوان
گسترش ττ-Lop به مدل ارتباطات MPI همزمان در خوشه های چندگانه
کلمات کلیدی
مدل های عملکرد موازی؛ الگوریتم های موازی؛ رابط عبور پیام؛ تجزیه و تحلیل عملکرد؛ خوشه های چندگانه
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی


• We present an extension of the ττ-Lop performance model for multicore clusters.
• The ττ-Lop goal is to help in the design and optimization of parallel algorithms.
• It is applied to collective algorithms in mainstream MPI implementations.
• The ττ-Lop model is compared to other well known and established models.
• A methodology is described for the measure of the parameters of the model.

Achieving optimal performance of MPI applications on current multi-core architectures, composed of multiple shared communication channels and deep memory hierarchies, is not trivial. Formal analysis using parallel performance models allows one to depict the underlying behavior of the algorithms and their communication complexities, with the aims of estimating their cost and improving their performance.LogGP model was initially conceived to predict the cost of algorithms in mono-processor clusters based on point-to-point transmissions with network latency and bandwidth based parameters. It remains as the representative model, with multiple extensions for handling high performance networks, covering particular contention cases, channels hierarchies or protocol costs. These very specific branches lead LogGP to partially lose its initial abstract modeling purpose.More recent lognlognP represents a point-to-point transmission as a sequence of implicit transfers or data movements. Nevertheless, similar to LogGP, it models an algorithm in a parallel architecture as a sequence of message transmissions, an approach inefficient to model algorithms more advanced than simple tree-based one, as we will show in this work.In this paper, ττ–Lop model is extended to multi-core clusters and compared to previous models. It demonstrates the ability to predict the cost of advanced algorithms and mechanisms used by mainstream MPI implementations, such as MPICH or Open MPI, with high accuracy. ττ–Lop is based on the concept of concurrent transfers, and applies it to meaningfully represent the behavior of parallel algorithms in complex platforms with hierarchical shared communication channels, taking into account the effects of contention and deployment of processes on the processors. In addition, an exhaustive and reproducible methodology for measuring the parameters of the model is described.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Future Generation Computer Systems - Volume 61, August 2016, Pages 66–82
نویسندگان
, , ,