دانلود رایگان مقاله: یادگیری تقویتی برای کنترل تطبیقی قوی از سیستم های غیرخطی ناشناخته با توجه به عدم اطمینان بی نظیر

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
6856225	1437949	2018	16 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties

ترجمه فارسی عنوان

یادگیری تقویتی برای کنترل تطبیقی قوی از سیستم های غیرخطی ناشناخته با توجه به عدم اطمینان بی نظیر

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

برنامه ریزی پویا سازگار، شبکه های عصبی، کنترل بهینه، تقویت یادگیری، کنترل قوی، عدم اطمینان بی نظیر،

adaptive dynamic programming - برنامه ریزی پویا تطبیقی Neural networks - شبکه های عصبی Optimal control - کنترل بهینه Robust control - کنترل قوی Reinforcement learning - یادگیری تقویتی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

یادگیری تقویتی برای کنترل تطبیقی قوی از سیستم های غیرخطی ناشناخته با توجه به عدم اطمینان بی نظیر

چکیده انگلیسی

This paper proposes a novel robust adaptive control strategy for partially unknown continuous-time nonlinear systems subject to unmatched uncertainties. Initially, the robust nonlinear control problem is converted into a nonlinear optimal control problem by constructing an appropriate value function for the auxiliary system. After that, within the framework of reinforcement learning, an identifier-critic architecture is developed. The presented architecture uses two neural networks: the identifier neural network (INN) which aims at estimating the unknown internal dynamics and the critic neural network (CNN) which tends to derive the approximate solution of the Hamilton-Jacobi-Bellman equation arising in the obtained optimal control problem. The INN is updated by using both the back-propagation algorithm and the e-modification technique. Meanwhile, the CNN is updated via the modified gradient descent method, which uses historical and current state data simultaneously. Based on the classic Lyapunov technique, all the signals in the closed-loop auxiliary system are proved to be uniformly ultimately bounded. Moreover, the original system is kept asymptotically stable under the obtained approximate optimal control. Finally, two illustrative examples, including the F-16 aircraft plant, are provided to demonstrate the effectiveness of the developed method.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volumes 463â464, October 2018, Pages 307-322

نویسندگان

Xiong Yang, Haibo He, Qinglai Wei, Biao Luo,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : یادگیری تقویتی برای کنترل تطبیقی قوی از سیستم های غیرخطی ناشناخته با توجه به عدم اطمینان بی نظیر

دسترسی سریع

ارتباط

English Website