کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1779448 1022001 2006 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Performance tuning of N-body codes on modern microprocessors: I. Direct integration with a hermite scheme on x86_64 architecture
موضوعات مرتبط
مهندسی و علوم پایه فیزیک و نجوم نجوم و فیزیک نجومی
پیش نمایش صفحه اول مقاله
Performance tuning of N-body codes on modern microprocessors: I. Direct integration with a hermite scheme on x86_64 architecture
چکیده انگلیسی

The main performance bottleneck of gravitational N-body codes is the force calculation between two particles. We have succeeded in speeding up this pair-wise force calculation by factors between 2 and 10, depending on the code and the processor on which the code is run. These speed-ups were obtained by writing highly fine-tuned code for x86_64 microprocessors. Any existing N-body code, running on these chips, can easily incorporate our assembly code programs.In the current paper, we present an outline of our overall approach, which we illustrate with one specific example: the use of a Hermite scheme for a direct N2 type integration on a single 2.0 GHz Athlon 64 processor, for which we obtain an effective performance of 4.05 Gflops, for double-precision accuracy. In subsequent papers, we will discuss other variations, including the combinations of N log N codes, single-precision implementations, and performance on other microprocessors.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: New Astronomy - Volume 12, Issue 3, December 2006, Pages 169–181
نویسندگان
, , ,