Sample path optimality for a Markov optimization problem

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
10527372	958840	2005	11 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

28D20 58F11 58F12 Markov decision process - روند تصمیم گیری مارکوف Stochastic control - کنترل تصادفی

موضوعات مرتبط

مهندسی و علوم پایه ریاضیات ریاضیات (عمومی)

پیش نمایش صفحه اول مقاله

Sample path optimality for a Markov optimization problem

چکیده انگلیسی

We study a unichain Markov decision process i.e. a controlled Markov process whose state process under a stationary policy is an ergodic Markov chain. Here the state and action spaces are assumed to be either finite or countable. When the state process is uniformly ergodic and the immediate cost is bounded then a policy that minimizes the long-term expected average cost also has an nth stage sample path cost that with probability one is asymptotically less than the nth stage sample path cost under any other non-optimal stationary policy with a larger expected average cost. This is a strengthening in the Markov model case of the a.s. asymptotically optimal property frequently discussed in the literature.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Stochastic Processes and their Applications - Volume 115, Issue 5, May 2005, Pages 769-779

نویسندگان

F.Y. Hunt,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Sample path optimality for a Markov optimization problem

دسترسی سریع

ارتباط

English Website