کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
429359 687477 2011 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Efficient multicore-aware parallelization strategies for iterative stencil computations
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Efficient multicore-aware parallelization strategies for iterative stencil computations
چکیده انگلیسی

Stencil computations consume a major part of runtime in many scientific simulation codes. As prototypes for this class of algorithms we consider the iterative Jacobi and Gauss-Seidel smoothers and aim at highly efficient parallel implementations for cache-based multicore architectures. Temporal cache blocking is a known advanced optimization technique, which can reduce the pressure on the memory bus significantly. We apply and refine this optimization for a recently presented temporal blocking strategy designed to explicitly utilize multicore characteristics. Especially for the case of Gauss-Seidel smoothers we show that simultaneous multi-threading (SMT) can yield substantial performance improvements for our optimized algorithm on some architectures.

Research highlights
► Wavefront/pipelined parallel temporal blocking optimization.
► Shown for Gauss-seidel and Jacobi stencils.
► Explicitly uses shared caches on recent multicore processors.
► Significant speedups versus optimal non-blocked code observed.
► Effect of simultaneous multithreading (SMT) analyzed.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Computational Science - Volume 2, Issue 2, May 2011, Pages 130–137
نویسندگان
, , ,