Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
486701 | Procedia Computer Science | 2012 | 10 Pages |
Abstract
In a complex processor landscape dominated by multi-and many-core processors, simplifying programming plays a crucial role in enhancing developers’ productivity. One way is to use highly tuned library functions. In this paper we present fastsg, an optimized library for the sparse grid technique with support for dimensional truncation. With optimizations for best cache use and vectorization, we improve the performance on one processor core up to a factor of 10. Parallelization using OpenMP scales almost linearly on a 12-core system.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science (General)