| کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن | 
|---|---|---|---|---|
| 768147 | 1462709 | 2014 | 10 صفحه PDF | دانلود رایگان | 
• The Correction Procedure via Reconstruction method is efficient for GPU CUDA computing.
• Up to seventy times speed-up is demonstrated when comparing the GPU CPR code to a CPU CPR code.
• High-order methods can benefit greatly from GPU CUDA computing.
Computational fluid dynamics (CFD) has long been a useful tool to model fluid flow problems across many engineering disciplines, and while problem size, complexity, and difficulty continue to expand, the demands for robustness and accuracy grow. Furthermore, generating high-order accurate solutions has escalated the required computational resources, and as problems continue to increase in complexity, so will computational needs such as memory requirements and calculation time for accurate flow field prediction. To improve upon computational time, vast amounts of computational power and resources are employed, but even over dozens to hundreds of central processing units (CPUs), the required computational time to compute solutions can be weeks, months, or longer, which is particularly true when generating high-order accurate solutions over large computational domains. One response to lower the computational time for CFD problems is to implement current CFD solvers on graphical processing units (GPUs). GPUs have illustrated the ability to solve problems orders of magnitude faster than their CPU counterparts with identical accuracy. The goal of the presented work is to combine a CFD solver and GPU computing with the intent to solve complex problems at a high-order of accuracy while lowering the computational time required to generate the solution. The CFD solver should have high-order spacial capabilities to evaluate small fluctuations and fluid structures not generally captured by lower-order methods (2nd and 1st order) and be efficient for the GPU architecture. This research combines the high-order Correction Procedure via Reconstruction (CPR) method for explicit time-stepping with compute unified device architecture (CUDA) from NVIDIA to reach these goals. In addition, the study demonstrates accuracy of the developed solver by comparing results with other solvers and exact solutions. Finally, to illustrate speed-ups, a single core Intel Xeon X5650 at 2.67 GHz is compared against a Tesla C2070 and a K20x, where a maximum speed-up of nearly 70x is demonstrated.
Journal: Computers & Fluids - Volume 101, 20 September 2014, Pages 263–272
