Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4960992 | Procedia Computer Science | 2017 | 10 Pages |
Abstract
We present a multithreaded method for supernodal sparse Cholesky factorization on a hybrid multicore platform consisting of a multicore CPU and GPU. Our algorithm can utilize concurrency at different levels of the elimination tree by using multiple threads in both the CPU and the GPU. The elimination tree is a tree data structure describing the workflow of the factorization. Our experiments results on a platform consisting of an Intel multicore processor along with an Nvidia GPU indicate a significant improvement in performance and energy over single-threaded supernodal algorithm.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science (General)
Authors
Meng Tang, Mohamed Gadou, Sanjay Ranka,