Article ID Journal Published Year Pages File Type
756588 Computers & Fluids 2013 6 Pages PDF
Abstract

On hierarchical parallel environment with multicore processors, mapping of subdomains to CPU/cores were optimized considering both the communication speed of different communication paths and the communication pattern of a parallel application based on the domain decomposition method. We evaluated proposed method on massively paralleled Intel Xeon PC cluster and confirmed that it could reduce communication time and achieve higher parallel performance than without mapping in several benchmark tests.

Related Topics
Physical Sciences and Engineering Engineering Computational Mechanics
Authors
, , ,