Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4962803 | Sustainable Computing: Informatics and Systems | 2016 | 11 Pages |
Abstract
We have implemented our CG solver using Altera's OpenCL SDK for FPGAs and use NVIDIA's CUBLAS library for the forward step on the GPU. Through the combination of GPU and FPGA we were able to achieve a speedup of 3.7Ã for large dense 24,064Â ÃÂ 24,064 matrices and require 3.5Ã less energy per solved right-hand side compared to a tuned multi-threaded CPU solver based on the ATLAS linear algebra library.
Keywords
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science (General)
Authors
C.M. Angerer, R. Polig, D. Zegarac, H. Giefers, C. Hagleitner, C. Bekas, A. Curioni,