Article ID Journal Published Year Pages File Type
6928504 Journal of Computational Physics 2018 25 Pages PDF
Abstract
The optimization of an elastodynamics simulation code for the KNL Many Integrated Core processor was performed. The optimization focused on data locality and vectorization. Results show that tiling of the data to exploit the cache behavior and allow for significant utilization of the KNL hardware. The MPI implementation allows for a scalable implementation enabling large problems to be simulated. The model results were validated against theoretical dispersion curves to within 2% of the group velocity, and within 0.5% of the phase velocity of the A0 mode. Aggressive use of tiling, threading, and vectorization techniques allowed for dramatically improved time to solution.
Related Topics
Physical Sciences and Engineering Computer Science Computer Science Applications
Authors
, , ,