کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
431787 688628 2013 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Towards accelerating smoothed particle hydrodynamics simulations for free-surface flows on multi-GPU clusters
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Towards accelerating smoothed particle hydrodynamics simulations for free-surface flows on multi-GPU clusters
چکیده انگلیسی

Starting from the single graphics processing unit (GPU) version of the Smoothed Particle Hydrodynamics (SPH) code DualSPHysics, a multi-GPU SPH program is developed for free-surface flows. The approach is based on a spatial decomposition technique, whereby different portions (sub-domains) of the physical system under study are assigned to different GPUs. Communication between devices is achieved with the use of Message Passing Interface (MPI) application programming interface (API) routines. The use of the sorting algorithm radix sort for inter-GPU particle migration and sub-domain “halo” building (which enables interaction between SPH particles of different sub-domains) is described in detail. With the resulting scheme it is possible, on the one hand, to carry out simulations that could also be performed on a single GPU, but they can now be performed even faster than on one of these devices alone. On the other hand, accelerated simulations can be performed with up to 32 million particles on the current architecture, which is beyond the limitations of a single GPU due to memory constraints. A study of weak and strong scaling behaviour, speedups and efficiency of the resulting program is presented including an investigation to elucidate the computational bottlenecks. Last, possibilities for reduction of the effects of overhead on computational efficiency in future versions of our scheme are discussed.


► A massively parallel approach to Smoothed Particle Hydrodynamics is presented.
► Multiple Graphics Processing Units (GPUs) are used.
► Up to 32 million particles are simulated at high speeds.
► Efficiency, speedup, and scaling behaviour of resulting program are studied.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 73, Issue 11, November 2013, Pages 1483–1493
نویسندگان
, , , ,