کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
432437 688890 2013 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Benchmarking of communication techniques for GPUs
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Benchmarking of communication techniques for GPUs
چکیده انگلیسی

We report about the performances obtained, at the application level, by two MPI implementations for Infiniband that allow direct exchange of data stored in the global memory of Graphic Processing Units (GPU) based on the Nvidia CUDA. For the same purpose, we tested also the Application Programming Interface of APEnet, which is a custom, high performance interconnect technology. As a benchmark we consider the time required to update a single spin of the 3D Heisenberg spin glass model by using the over-relaxation algorithm. The results show that CUDA streams are instrumental in achieving the best possible performances.


► Perfect scaling is possible by using overlap between communication and computation.
► CUDA streams are crucial to achieve overlap between communication and computation.
► APEnet+ is a first example of interconnection technology that does not need a CPU.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 73, Issue 2, February 2013, Pages 250–255
نویسندگان
, , ,