An actor-critic algorithm for constrained Markov decision processes

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
10413114	895495	2005	7 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Stochastic approximation - تقریبی تصادفی Envelope theorem - قضیه پاکت Reinforcement learning - یادگیری تقویتی

موضوعات مرتبط

مهندسی و علوم پایه سایر رشته های مهندسی کنترل و سیستم های مهندسی

پیش نمایش صفحه اول مقاله

An actor-critic algorithm for constrained Markov decision processes

چکیده انگلیسی

An actor-critic type reinforcement learning algorithm is proposed and analyzed for constrained controlled Markov decision processes. The analysis uses multiscale stochastic approximation theory and the `envelope theorem' of mathematical economics.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Systems & Control Letters - Volume 54, Issue 3, March 2005, Pages 207-213

نویسندگان

V.S. Borkar,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

An actor-critic algorithm for constrained Markov decision processes

دسترسی سریع

ارتباط

English Website