Article ID Journal Published Year Pages File Type
4962803 Sustainable Computing: Informatics and Systems 2016 11 Pages PDF
Abstract
We have implemented our CG solver using Altera's OpenCL SDK for FPGAs and use NVIDIA's CUBLAS library for the forward step on the GPU. Through the combination of GPU and FPGA we were able to achieve a speedup of 3.7× for large dense 24,064 × 24,064 matrices and require 3.5× less energy per solved right-hand side compared to a tuned multi-threaded CPU solver based on the ATLAS linear algebra library.
Related Topics
Physical Sciences and Engineering Computer Science Computer Science (General)
Authors
, , , , , , ,