کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
710367 892109 2016 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Dual MPC with Reinforcement Learning
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی مکانیک محاسباتی
پیش نمایش صفحه اول مقاله
Dual MPC with Reinforcement Learning
چکیده انگلیسی

An adaptive optimal control algorithm for systems with uncertain dynamics is formulated under a Reinforcement Learning framework. An embedded exploratory component is included explicitly in the objective function of an output feedback receding horizon Model Predictive Control problem. The optimization is formulated as a Quadratically Constrained Quadratic Program and it is solved to e-global optimality. The iterative interaction between the action specified by the optimal solution and the approximation of cost functions balances the exploitation of current knowledge and the need for exploration. The proposed method is shown to converge to the optimal policy for a controllable discrete time linear plant with unknown output parameters.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: IFAC-PapersOnLine - Volume 49, Issue 7, 2016, Pages 266–271
نویسندگان
, ,