کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6935330 868794 2014 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Adaptive block size for dense QR factorization in hybrid CPU-GPU systems via statistical modeling
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Adaptive block size for dense QR factorization in hybrid CPU-GPU systems via statistical modeling
چکیده انگلیسی
QR factorization is a computational kernel of scientific computing. How can the latest computer be used to accelerate this task? We investigate this topic by proposing a dense QR factorization algorithm with adaptive block sizes on a hybrid system that contains a central processing unit (CPU) and a graphic processing unit (GPU). To maximize the use of CPU and GPU, we develop an adaptive scheme that chooses block size at each iteration. The decision is based on statistical surrogate models of performance and an online monitor, which avoids unexpected occasional performance drops. We modify the highly optimized CPU-GPU based QR factorization in MAGMA to implement the proposed schemes. Numerical results suggest that our approaches are efficient and can lead to near-optimal block sizes. The proposed algorithm can be extended to other one-sided factorizations, such as LU and Cholesky factorizations.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Parallel Computing - Volume 40, Issues 5–6, May 2014, Pages 70-85
نویسندگان
, , ,