کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
429584 687606 2012 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Implementation of a parallel tree method on a GPU
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Implementation of a parallel tree method on a GPU
چکیده انگلیسی

The kd-tree is a fundamental tool in computer science. Among other applications, the application of kd-tree search (by the tree method) to the fast evaluation of particle interactions and neighbor search is highly important, since the computational complexity of these problems is reduced from O(N2) for a brute force method to O(N log  N) for the tree method, where N is the number of particles. In this paper, we present a parallel implementation of the tree method running on a graphics processing unit (GPU). We present a detailed description of how we have implemented the tree method on a Cypress GPU. An optimization that we found important is localized particle ordering to effectively utilize cache memory. We present a number of test results and performance measurements. Our results show that the execution of the tree traversal in a force calculation on a GPU is practical and efficient.


► We have implemented a parallel tree method on a GPU.
► The localized particle ordering is an important optimization on the GPU.
► Our method on the GPU is faster than a brute-force method on the GPU.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Computational Science - Volume 3, Issue 3, May 2012, Pages 132–141
نویسندگان
,