Performance modeling and analysis of heterogeneous lattice Boltzmann simulations on CPU

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
523874	868516	2015	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Performance modeling and analysis of heterogeneous lattice Boltzmann simulations on CPU–GPU clusters

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Lattice Boltzmann method - روش شبکه بولتزمن Performance modeling - مدل سازی عملکرد CUDA - کودا. پردازش موازی و مدل برنامه‌نویسی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر

پیش نمایش صفحه اول مقاله

Performance modeling and analysis of heterogeneous lattice Boltzmann simulations on CPU–GPU clusters

چکیده انگلیسی

• Introduction of software design concepts for multi-GPU simulations.
• A study on how to efficiently exploit heterogeneous compute nodes.
• A detailed analysis of the performance of parallel multi-GPU.
• Heterogeneous lattice Boltzmann simulations on Tsubame 2.0.
• A performance model for the communication overhead.

Computational fluid dynamic simulations are in general very compute intensive. Only by parallel simulations on modern supercomputers the computational demands of complex simulation tasks can be satisfied. Facing these computational demands GPUs offer high performance, as they provide the high floating point performance and memory to processor chip bandwidth. To successfully utilize GPU clusters for the daily business of a large community, usable software frameworks must be established on these clusters. The development of such software frameworks is only feasible with maintainable software designs that consider performance as a design objective right from the start. For this work we extend the software design concepts to achieve more efficient and highly scalable multi-GPU parallelization within our software framework waLBerla for multi-physics simulations centered around the lattice Boltzmann method. Our software designs now also support a pure-MPI and a hybrid parallelization approach capable of heterogeneous simulations using CPUs and GPUs in parallel. For the first time weak and strong scaling performance results obtained on the Tsubame 2.0 cluster for more than 1000 GPUs are presented using waLBerla. With the help of a new communication model the parallel efficiency of our implementation is investigated and analyzed in a detailed and structured performance analysis. The suitability of the waLBerla framework for production runs on large GPU clusters is demonstrated. As one possible application we show results of strong scaling experiments for flows through a porous medium.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Parallel Computing - Volume 46, July 2015, Pages 1–13

نویسندگان

Christian Feichtinger, Johannes Habich, Harald Köstler, Ulrich Rüde, Takayuki Aoki,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Performance modeling and analysis of heterogeneous lattice Boltzmann simulations on CPU–GPU clusters

دسترسی سریع

ارتباط

English Website