کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
524057 868549 2013 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
All-pairs computations on many-core graphics processors
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
All-pairs computations on many-core graphics processors
چکیده انگلیسی

Developing high-performance applications on emerging multi- and many-core architectures requires efficient mapping techniques and architecture-specific tuning methodologies to realize performance closer to their peak compute capability and memory bandwidth. In this paper, we develop architecture-aware methods to accelerate all-pairs computations on many-core graphics processors. Pairwise computations occur frequently in numerous application areas in scientific computing. While they appear easy to parallelize due to the independence of computing each pairwise interaction from all others, development of techniques to address multi-layered memory hierarchies, mapping within the restrictions imposed by the small and low-latency on-chip memories, striking the right balanced between concurrency, reuse and memory traffic etc., are crucial to obtain high-performance. We present a hierarchical decomposition scheme for GPUs based on decomposition of the output matrix and input data. We demonstrate that a careful tuning of the involved set of decomposition parameters is essential to achieve high efficiency on the GPUs. We also compare the performance of our strategies with an implementation on the STI Cell processor as well as multi-core CPU parallelizations using OpenMP and Intel Threading Building Blocks.


► We develop architecture-aware high-performance methods to accelerate generalized all-pairs computations graphics processors.
► We perform in-depth analysis on how to carefully tune the parameters involved.
► We demonstrate the methods through applications in fluid dynamics and material sciences.
► We compare the performances on graphics processors, Cell processors, and multi-core CPUs.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Parallel Computing - Volume 39, Issue 2, February 2013, Pages 79–93
نویسندگان
, ,