Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
10356340 | Journal of Computational Physics | 2011 | 10 Pages |
Abstract
We present an implementation of a 2D fully relativistic, electromagnetic particle-in-cell code, with charge-conserving current deposition, on parallel graphics processors (GPU) with CUDA. The GPU implementation achieved a one particle-step process time of 2.52Â ns for cold plasma runs and 9.15Â ns for extremely relativistic plasma runs, which are respectively 81 and 27 times faster than a single threaded state-of-art CPU code. A particle-based computation thread assignment was used in the current deposition scheme and write conflicts among the threads were resolved by a thread racing technique. A parallel particle sorting scheme was also developed and used. The implementation took advantage of fast on-chip shared memory, and can in principle be extended to 3D.
Keywords
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science Applications
Authors
Xianglong Kong, Michael C. Huang, Chuang Ren, Viktor K. Decyk,