کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
431533 | 688570 | 2012 | 8 صفحه PDF | دانلود رایگان |
In this work we describe some parallel algorithms for solving nonlinear systems using CUDA (Compute Unified Device Architecture) over a GPU (Graphics Processing Unit). The proposed algorithms are based on both the Fletcher–Reeves version of the nonlinear conjugate gradient method and a polynomial preconditioner type based on block two-stage methods. Several strategies of parallelization and different storage formats for sparse matrices are discussed. The reported numerical experiments analyze the behavior of these algorithms working in a fine grain parallel environment compared with a thread-based environment.
► We propose parallel algorithms for nonlinear systems using CUDA over a GPU.
► Several strategies of parallelization and storage formats are discussed.
► A multicore OpenMP model, a GPGPU model and a mixed model are analyzed.
► The use of CUBLAS and CUSPARSE libraries offer a good performance.
► The use of GPU improves the results obtained using any of the proposed methods.
Journal: Journal of Parallel and Distributed Computing - Volume 72, Issue 9, September 2012, Pages 1098–1105