Maximizing the set of recurrent states of an MDP subject to convex constraints

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
695511	890305	2014	5 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Convex optimization - بهینه سازی محدب Maximum Entropy - حداکثر آنتروپی Markov models - مدل مارکف Markov decision problems - مشکلات تصمیم مارکوف Optimal control - کنترل بهینه

موضوعات مرتبط

مهندسی و علوم پایه سایر رشته های مهندسی کنترل و سیستم های مهندسی

پیش نمایش صفحه اول مقاله

Maximizing the set of recurrent states of an MDP subject to convex constraints

چکیده انگلیسی

This paper focuses on the design of time-homogeneous fully observed Markov decision processes (MDPs), with finite state and action spaces. The main objective is to obtain policies that generate the maximal set of recurrent states, subject to convex constraints on the set of invariant probability mass functions. We propose a design method that relies on a finitely parametrized convex program inspired on principles of entropy maximization. A numerical example is provided to illustrate these ideas.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Automatica - Volume 50, Issue 3, March 2014, Pages 994–998

نویسندگان

Eduardo Arvelo, Nuno C. Martins,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Maximizing the set of recurrent states of an MDP subject to convex constraints

دسترسی سریع

ارتباط

English Website