Multi-policy improvement in stochastic optimization with forward recursive function criteria

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
9503149	1339557	2005	10 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Invariant imbedding - تجمع غیرقابل پیش بینی

موضوعات مرتبط

مهندسی و علوم پایه ریاضیات آنالیز ریاضی

پیش نمایش صفحه اول مقاله

Multi-policy improvement in stochastic optimization with forward recursive function criteria

چکیده انگلیسی

Iwamoto recently established a formal transformation via an invariant imbedding to construct a controlled Markov chain that can be solved in a backward manner, as in backward induction for finite-horizon Markov decision processes (MDPs), for a given controlled Markov chain with non-additive forward recursive objective function criterion. Chang et al. presented formal methods, called “parallel rollout” and “policy switching,” of combining given multiple policies in MDPs and showed that the policies generated by both methods improve all of the policies that the methods combine. This brief paper extends the methods of parallel rollout and policy switching for forward recursive objective function criteria and shows that the similar property holds as in MDPs. We further discuss how to implement these methods via simulation.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Mathematical Analysis and Applications - Volume 305, Issue 1, 1 May 2005, Pages 130-139

نویسندگان

Hyeong Soo Chang,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Multi-policy improvement in stochastic optimization with forward recursive function criteria

دسترسی سریع

ارتباط

English Website