Article ID Journal Published Year Pages File Type
488152 Procedia Computer Science 2011 9 Pages PDF
Abstract

We propose an optimized implementation of the MERAM method and preliminary experiments to solve non-Hermitian eigenproblems faster using this asynchronous hybrid method. We focus on improving the communication pattern by providing an entity called the collector as well as an optimized communication scheme using MPI-2 one-sided communications. The scalability of the parallelization is discussed and experiments are done to show how well our implementation scales to a large number of nodes. The one 8 cores node computing time of 3800 seconds is reduced to 4 seconds using 1200 Nehalem cores, and we achieve linear to superlinear speed-ups thanks to our efficient communication pattern and the coarse-grained parallel nature of MERAM. Our approach achieved an optimal performance on more than a thousand cores for the first time with MERAM. We conclude that hybrid asynchronous methods like MERAM with a good communication patter offer tremendous possibilities for high performance computing.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science (General)