Numerical analysis of continuous time Markov decision processes over finite horizons

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
475740	699368	2011	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Finite horizon - افق نهایی Error bounds - خطاهای خطا Uniformization - یکپارچه سازی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)

پیش نمایش صفحه اول مقاله

Numerical analysis of continuous time Markov decision processes over finite horizons

چکیده انگلیسی

Continuous time Markov decision processes (CTMDPs) with a finite state and action space have been considered for a long time. It is known that under fairly general conditions the reward gained over a finite horizon can be maximized by a so-called piecewise constant policy which changes only finitely often in a finite interval. Although this result is available for more than 30 years, numerical analysis approaches to compute the optimal policy and reward are restricted to discretization methods which are known to converge to the true solution if the discretization step goes to zero. In this paper, we present a new method that is based on uniformization of the CTMDP and allows one to compute an ε-optimalε-optimal policy up to a predefined precision in a numerically stable way using adaptive time steps.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers & Operations Research - Volume 38, Issue 3, March 2011, Pages 651–659

نویسندگان

Peter Buchholz, Ingo Schulz,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Numerical analysis of continuous time Markov decision processes over finite horizons

دسترسی سریع

ارتباط

English Website