کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
698453 890408 2006 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Actor-critic algorithms for hierarchical Markov decision processes
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی کنترل و سیستم های مهندسی
پیش نمایش صفحه اول مقاله
Actor-critic algorithms for hierarchical Markov decision processes
چکیده انگلیسی

We consider the problem of control of hierarchical Markov decision processes and develop a simulation based two-timescale actor-critic algorithm in a general framework. We also develop certain approximation algorithms that require less computation and satisfy a performance bound. One of the approximation algorithms is a three-timescale actor-critic algorithm while the other is a two-timescale algorithm, however, which operates in two separate stages. All our algorithms recursively update randomized policies using the simultaneous perturbation stochastic approximation (SPSA) methodology. We briefly present the convergence analysis of our algorithms. We then present numerical experiments on a problem of production planning in semiconductor fabs on which we compare the performance of all algorithms together with policy iteration. Algorithms based on certain Hadamard matrix based deterministic perturbations are found to show the best results.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Automatica - Volume 42, Issue 4, April 2006, Pages 637–644
نویسندگان
, ,