کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
512832 866432 2012 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
On the effective implementation of a boundary element code on graphics processing units using an out-of-core LU algorithm
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
On the effective implementation of a boundary element code on graphics processing units using an out-of-core LU algorithm
چکیده انگلیسی

A collocation boundary element code for solving the three-dimensional Laplace equation, publicly available from http://intetec.org, has been adapted to run on an Nvidia Tesla general-purpose graphics processing unit (GPU). Global matrix assembly and LU factorization of the resulting dense matrix are performed on the GPU. Out-of-core techniques are used to solve problems larger than the available GPU memory. The code achieved about 10 times speedup in matrix assembly over a single CPU core and about 56 Gflops/s in the LU factorization using only 512 Mbytes of GPU memory. Details of the GPU implementation and comparisons with the standard sequential algorithm are included to illustrate the performance of the GPU code.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Engineering Analysis with Boundary Elements - Volume 36, Issue 8, August 2012, Pages 1246–1255
نویسندگان
, ,