دانلود رایگان مقاله: یادگیری تقویت یکپارچه و بازیابی تجربه برای کنترل بهینه سازگار با سیستم های مداوم زمان ورودی محدود و ناشناخته

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
10398677	890302	2014	10 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems

ترجمه فارسی عنوان

یادگیری تقویت یکپارچه و بازیابی تجربه برای کنترل بهینه سازگار با سیستم های مداوم زمان ورودی محدود و ناشناخته

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

یادگیری تقویت انتگرال، تجربه پخش، کنترل بهینه، شبکه های عصبی، محدودیت ورودی،

Neural networks - شبکه های عصبی Input constraints - محدودیت های ورودی Optimal control - کنترل بهینه

موضوعات مرتبط

مهندسی و علوم پایه سایر رشته های مهندسی کنترل و سیستم های مهندسی

پیش نمایش مقاله

یادگیری تقویت یکپارچه و بازیابی تجربه برای کنترل بهینه سازگار با سیستم های مداوم زمان ورودی محدود و ناشناخته

چکیده انگلیسی

In this paper, an integral reinforcement learning (IRL) algorithm on an actor-critic structure is developed to learn online the solution to the Hamilton-Jacobi-Bellman equation for partially-unknown constrained-input systems. The technique of experience replay is used to update the critic weights to solve an IRL Bellman equation. This means, unlike existing reinforcement learning algorithms, recorded past experiences are used concurrently with current data for adaptation of the critic weights. It is shown that using this technique, instead of the traditional persistence of excitation condition which is often difficult or impossible to verify online, an easy-to-check condition on the richness of the recorded data is sufficient to guarantee convergence to a near-optimal control law. Stability of the proposed feedback control law is shown and the effectiveness of the proposed method is illustrated with simulation examples.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Automatica - Volume 50, Issue 1, January 2014, Pages 193-202

نویسندگان

Hamidreza Modares, Frank L. Lewis, Mohammad-Bagher Naghibi-Sistani,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : یادگیری تقویت یکپارچه و بازیابی تجربه برای کنترل بهینه سازگار با سیستم های مداوم زمان ورودی محدود و ناشناخته

دسترسی سریع

ارتباط

English Website