Adapting wave-front algorithms to efficiently utilize systems with deep communication hierarchies

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
524680	868824	2011	12 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Performance analysis - تجزیه و تحلیل عملکرد Hybrid systems - سیستم هیبریدی High performance computing - محاسبات با کارایی بالا Performance modeling - مدل سازی عملکرد Programming models - مدل های برنامه نویسی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر

پیش نمایش صفحه اول مقاله

Adapting wave-front algorithms to efficiently utilize systems with deep communication hierarchies

چکیده انگلیسی

Large-scale systems increasingly exhibit a differential between intra-chip and inter-chip communication performance especially in hybrid systems using accelerators. Processor-cores on the same socket are able to communicate at lower latencies, and with higher bandwidths, than cores on different sockets either within the same node or between nodes. A key challenge is to efficiently use this communication hierarchy and hence optimize performance. We consider here the class of applications that contains wave-front processing. In these applications data can only be processed after their upstream neighbors have been processed. Similar dependencies result between processors in which communication is required to pass boundary data downstream and whose cost is typically impacted by the slowest communication channel in use. In this work we develop a novel hierarchical wave-front approach that reduces the use of slower communications in the hierarchy but at the cost of additional steps in the parallel computation and higher use of on-chip communications. This tradeoff is explored using a performance model. An implementation using the reverse-acceleration programming model on the petascale Roadrunner system demonstrates a 27% performance improvement at full system-scale on a kernel application. The approach is generally applicable to large-scale multi-core and accelerated systems where a differential in communication performance exists.

Research highlights
► We develop a novel implementation of wavefront algorithms for hybrid supercomputers.
► The use of slow communications is reduced but at the cost of increased compute steps.
► The tradeoff is explored for many system configurations using a performance model.
► An implementation on the petascale Roadrunner system demonstrates a 27% improvement.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Parallel Computing - Volume 37, Issue 9, September 2011, Pages 550–561

نویسندگان

Darren J. Kerbyson, Michael Lang, Scott Pakin,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Adapting wave-front algorithms to efficiently utilize systems with deep communication hierarchies

دسترسی سریع

ارتباط

English Website