کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
432664 689021 2015 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Intrinsic fault tolerance of multilevel Monte Carlo methods
ترجمه فارسی عنوان
تحمل گسل درونی چند سطحی از روش های مونت کارلو
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی


• Fault tolerant Monte Carlo/fault tolerant multilevel Monte Carlo.
• Intrinisic recilience to hardware failure at runtime.
• Exascale parallel computing.

Monte Carlo (MC) and multilevel Monte Carlo (MLMC) methods applied to solvers for Partial Differential Equations with random input data are proved to exhibit intrinsic failure resilience. Sufficient conditions are provided for non-recoverable loss of a random fraction of MC samples not to fatally damage the asymptotic accuracy versus work of a MC simulation. Specifically, the convergence behavior of MLMC methods on massively parallel hardware with runtime faults is analyzed mathematically and investigated computationally. Our mathematical model assumes node failures which occur uncorrelated of MC sampling and with general sample failure statistics on the different levels and which also assume absence of checkpointing, i.e., we assume irrecoverable sample failures with complete loss of data. Modifications of the MLMC with enhanced resilience are proposed. The theoretical results are obtained under general statistical models of CPU failure at runtime. Particular attention is paid to node failures with so-called Weibull failure models on massively parallel stochastic finite volume computational fluid dynamics simulations are discussed. We discuss the resilience of massively parallel stochastic Finite Volume computational fluid dynamics simulations.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 84, October 2015, Pages 24–36
نویسندگان
, , ,