کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
432707 689043 2014 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Highly scalable computational algorithms on emerging parallel machine multicore architectures II: Development and implementation in the CSD and FSI contexts
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Highly scalable computational algorithms on emerging parallel machine multicore architectures II: Development and implementation in the CSD and FSI contexts
چکیده انگلیسی


• Extend highly scalable algorithms for CSD and FSI modules on emerging parallel HPC architectures.
• Develop a new solver specific multicore-FSI optimal partitioning to improve the FSI scalability.
• The intelligent partitioning optimizes the number of solid domains relative to number of fluid domains.
• Demonstrate good linear (weak) scalability for aeroelastic applications (O(1e6) cells, O(1e3) cores).
• Demonstrate significant scalability improvement for medium sized FSI problems using new ideas.

In this paper, the second in a series, the authors have extended and implemented their computational algorithms for improving the scalability of CSD (Computational Structural Dynamics) and FSI (Fluid–Structure Interaction) simulations on emerging architectures like multicore High Performance Computing (HPC) platforms. These algorithmic developments and extensions are classified into two categories: (i) enhanced scalability for CSD simulations on multicore platforms, (ii) newer ideas for running FSI simulations. In the first category, the authors employed the ideas developed in the first paper of this series including the multilevel partitioning strategy, next generation optimized communication procedure and better memory management to get enhanced scalability for CSD simulations. In the second category, the authors came up with a novel solver specific multicore-FSI optimal partitioning so as to improve the overall FSI scalability. After implementing the new “intelligent partitioning” algorithm, a speedup ratio of nearly 2.5x was obtained for the total time. The intelligent partitioning algorithm optimizes the number of solid domains relative to the number of fluid domains to optimize the overall FSI solution, irrespective of the type of the flow solver. In general, the authors have demonstrated (i) good, almost linear scalability for aeroelastic applications with several millions of cells on multicore platforms with thousands of cores, (ii) significant improvement in the scalability for smaller FSI problems using the intelligent partitioning.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 74, Issue 9, September 2014, Pages 2808–2817
نویسندگان
, , , , ,