کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6931671 867703 2015 18 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Towards an ultra efficient kinetic scheme. Part III: High-performance-computing
ترجمه فارسی عنوان
به سوی یک طرح سینتیکی فوق العاده کارآمد. قسمت سوم: محاسبات با کارایی بالا
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی
In this paper we demonstrate the capability of the fast semi-Lagrangian scheme developed in [20] and [21] to deal with parallel architectures. First, we will present the behaviors of such scheme on a classical architecture using OpenMP and then on GPU (Graphics Processing Unit) architecture using CUDA. The goal is to prove that this new scheme is well adapted to these types of parallelizations, and, moreover that the gain in CPU time is substantial on nowadays affordable computers. We first present the sequential version of our high-order kinetic scheme and focus on important details for an effective parallel implementation. Then, we introduce the specific treatments and algorithms which have been developed for an OpenMP and CUDA parallelizations. Numerical tests are shown for the full 3D/3D simulations. These assess the important speed-up factor of the method gained between the sequential code and the parallel versions and its very good scalability which makes this approach a real competitor with respect to existing schemes for the solution of multidimensional kinetic models.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Computational Physics - Volume 284, 1 March 2015, Pages 22-39
نویسندگان
, , ,