کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
501967 | 863673 | 2013 | 14 صفحه PDF | دانلود رایگان |
A Roe’s flux-difference splitting scheme has been implemented using the NVIDIA CUDA architecture and has been applied to solve the two-dimensional compressible Euler equations. Different standard test cases have been considered in order to estimate the speed-up of GPU computing with respect to CPU calculation. A detailed description of the kernel configuration has been provided and a theoretical analysis of the GPU execution time as a function of the number of threads managed by the kernels is also reported. The loss of performance has been fully described consequent to the use of zero-copy memory. Significant performance improvements have been obtained by using a more recent GPU and CUDA Toolkit. A test case on multi-GPU architecture has been presented in the domain decomposition approach.
Journal: Computer Physics Communications - Volume 184, Issue 6, June 2013, Pages 1497–1510