Stochastic linear quadratic optimal control for model-free discrete-time systems based on Q-learning algorithm

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
6863525	1439515	2018	26 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Q-learning Well-posedness - خوشبختی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

Stochastic linear quadratic optimal control for model-free discrete-time systems based on Q-learning algorithm

چکیده انگلیسی

Solving the stochastic linear quadratic (SLQ) optimal control problem generally needs full information about system dynamics. In this paper, a Q-learning iteration algorithm is adopted to solve the control problem for model-free discrete-time systems. Firstly, the condition of the well-posedness for the SLQ problem is given. In order to solve the SLQ problem, the stochastic problem is transformed into the deterministic one. Secondly, in the iteration process of Q-learning algorithm, the H matrix sequence and control gain matrix sequence are obtained without the knowledge of system parameters, and the convergence proof of two sequences is also given. Lastly, two simulation examples are supplied to explain the effectiveness of the Q-learning algorithm.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 312, 27 October 2018, Pages 1-8

نویسندگان

Tao Wang, Huaguang Zhang, Yanhong Luo,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Stochastic linear quadratic optimal control for model-free discrete-time systems based on Q-learning algorithm

دسترسی سریع

ارتباط

English Website