Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
10340999 | Computers & Electrical Engineering | 2014 | 15 Pages |
Abstract
In heterogeneous computing, application developers have to identify the best-suited target platform from a variety of alternatives. In this work, we compare performance and architectural efficiency of Graphics Processing Units (GPUs) and Field Programmable Gate Arrays (FPGAs) for two algorithms taken from a novel medical imaging method named 3D ultrasound computer tomography. From the 40Â nm and 28Â nm generations, we use top-notch devices and those with similar power consumption values. For our two benchmark algorithms from the signal processing and imaging domain, the results show that if power consumption is not considered, the GPU and FPGA from the 40nm generation give both, a similar performance and efficiency per transistor. In the 28Â nm process, in contrast, the FPGA is superior to its GPU counterpart by 86% and 39%, depending on the algorithm. If power is limited, FPGAs outperform GPUs in each investigated case by at least a factor of four.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Networks and Communications
Authors
Matthias Birk, Matthias Balzer, Nicole V. Ruiter, Juergen Becker,