کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
523932 868529 2014 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A study of shared-memory parallelism in a multifrontal solver
ترجمه فارسی عنوان
مطالعهی همپوشانی حافظه مشترک در یک پردازنده چند منظوره
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• We study shared-memory parallelism in a multifrontal solver.
• We combine the use of OpenMP directives, multithreaded libraries, and MPI.
• We revisit and improve upon the Geist–Ng algorithm on multicore systems relying on a performance model.
• We propose and analyze simple approaches to take advantage of NUMA architectures.
• We briefly present an alternative implementation of work-stealing to limit synchronization costs.

We introduce shared-memory parallelism in a parallel distributed-memory solver, targeting multi-core architectures. Our concern in this paper is pure shared-memory parallelism, although the work will also impact distributed-memory parallelism. Our approach avoids a deep redesign and fully benefits from the numerical kernels and features of the original code. We use performance models to exploit coarse-grain parallelism in an OpenMP environment while, at the same time, also relying on third-party optimized multithreaded libraries. In this context, we propose simple approaches to take advantage of NUMA architectures, and original optimizations to limit thread synchronization costs. The performance gains are analyzed in detail on test problems from various application areas. Although the studied code is a direct solver for sparse systems of linear equations, the contributions of this paper are more general and could be useful in a wider range of situations.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Parallel Computing - Volume 40, Issues 3–4, March 2014, Pages 34–46
نویسندگان
, ,