دانلود رایگان مقاله: کنترل ردیابی بهینه برای سیستم های پرش مارکوف کاملا غیرمعمول ناشناخته زمان گسسته با استفاده از روش یادگیری تقویتی مبتنی بر داده ها

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
408306	679017	2016	7 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Optimal tracking control for completely unknown nonlinear discrete-time Markov jump systems using data-based reinforcement learning method

ترجمه فارسی عنوان

کنترل ردیابی بهینه برای سیستم های پرش مارکوف کاملا غیرمعمول ناشناخته زمان گسسته با استفاده از روش یادگیری تقویتی مبتنی بر داده ها

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

کنترل ردیابی بهینه، سیستم پرش مارکوف، مبتنی بر داده ها، تقویت یادگیری، برنامه ریزی پویا سازگار، شبکه های عصبی

Data-based adaptive dynamic programming - برنامه ریزی پویا تطبیقی Markov jump systems - سیستم پرش مارکوف Neural networks - شبکه های عصبی Optimal tracking control - کنترل ردیابی بهینه Reinforcement learning - یادگیری تقویتی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

کنترل ردیابی بهینه برای سیستم های پرش مارکوف کاملا غیرمعمول ناشناخته زمان گسسته با استفاده از روش یادگیری تقویتی مبتنی بر داده ها

چکیده انگلیسی

In this paper, we develop a novel optimal tracking control scheme for a class of nonlinear discrete-time Markov jump systems (MJSs) by utilizing a data-based reinforcement learning method. It is not practical to obtain accurate system models of the real-world MJSs due to the existence of abrupt variations in their system structures. Consequently, most traditional model-based methods for MJSs are invalid for the practical engineering applications. In order to overcome the difficulties without any identification scheme which would cause estimation errors, a model-free adaptive dynamic programming (ADP) algorithm will be designed by using system data rather than accurate system functions. Firstly, we combine the tracking error dynamics and reference system dynamics to form an augmented system. Then, based on the augmented system, a new performance index function with discount factor is formulated for the optimal tracking control problem via Markov chain and weighted sum technique. Neural networks are employed to implement the on-line ADP learning algorithm. Finally, a simulation example is given to demonstrate the effectiveness of our proposed approach.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 194, 19 June 2016, Pages 176–182

نویسندگان

He Jiang, Huaguang Zhang, Yanhong Luo, Junyi Wang,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : کنترل ردیابی بهینه برای سیستم های پرش مارکوف کاملا غیرمعمول ناشناخته زمان گسسته با استفاده از روش یادگیری تقویتی مبتنی بر داده ها

دسترسی سریع

ارتباط

English Website