کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
523777 868491 2015 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Improved strong scaling of a spectral/finite difference gyrokinetic code for multi-scale plasma turbulence
ترجمه فارسی عنوان
مقیاس قوی تقویت یک کد ژیروسکوپ طیفی / متناهی محدود برای آشفتگی پلاسمای چندگانه
کلمات کلیدی
ژیروکینتیکس، کاربرد دینامیک سیال محاسباتی مقیاس پذیری موازی، روش طیفی، تفاوت محدود
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• Optimization techniques of a 5D plasma turbulence simulation code are presented.
• Segmented MPI-process mapping on 3D torus networks reduces communication costs.
• Pipelined computation-communication overlaps effectively mask communication costs.
• The code achieves strong excellent strong scaling up to 600k cores.
• Cross-scale interactions of multi-scale plasma turbulence is demonstrated.

Optimization techniques of a plasma turbulence simulation code GKV for improved strong scaling are presented. This work is motivated by multi-scale plasma turbulence extending over multiple spatio-temporal scales of electrons and ions, whose simulations based on the gyrokinetic theory require huge calculations of five-dimensional (5D) computational fluid dynamics by means of spectral and finite difference methods. First, we present the multi-layer domain decomposition of the multi-dimensional and multi-species problem, and segmented MPI-process mapping on 3D torus interconnects, which fully utilizes the bi-section bandwidth for data transpose and reduces the conflicts of simultaneous point-to-point communications. These techniques reduce the inter-node communication cost drastically. Second, pipelined computation-communication overlaps are implemented by using the OpenMP/MPI hybrid parallelization, which effectively mask the communication cost. Careful regulations of the pipeline length and of the thread granularity respectively suppress latencies of MPI, load imbalance and scheduling overheads of OpenMP. Thanks to the above optimizations, GKV achieves excellent strong scaling up to ∼600k cores with high computational performance 782.4 TFlops (8.29% of the theoretical peak) and high effective parallelization rate ∼99.99994% on K, which demonstrates its applicability and efficiency toward a million of cores. The optimized code realizes multi-scale plasma turbulence simulations covering electron and ion scales, and reveals cross-scale interactions of electron- and ion-scale turbulence.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Parallel Computing - Volume 49, November 2015, Pages 1–12
نویسندگان
, , , , , ,