دانلود رایگان مقاله: جستجو در مورد آینه و شتاب آن

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
6867101	1439836	2018	10 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Mirror descent search and its acceleration

ترجمه فارسی عنوان

جستجو در مورد آینه و شتاب آن

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Bregman divergence - واگرایی Bregman Reinforcement learning - یادگیری تقویتی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

چکیده انگلیسی

In recent years, attention has been focused on the relationship between black-box optimization problem and reinforcement learning problem. In this research, we propose the Mirror Descent Search (MDS) algorithm which is applicable both for black box optimization problems and reinforcement learning problems. Our method is based on the mirror descent method, which is a general optimization algorithm. The contribution of this research is roughly twofold. We propose two essential algorithms, called MDS and Accelerated Mirror Descent Search (AMDS), and two more approximate algorithms: Gaussian Mirror Descent Search (G-MDS) and Gaussian Accelerated Mirror Descent Search (G-AMDS). This research shows that the advanced methods developed in the context of the mirror descent research can be applied to reinforcement learning problem. We also clarify the relationship between an existing reinforcement learning algorithm and our method. With two evaluation experiments, we show our proposed algorithms converge faster than some state-of-the-art methods.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Robotics and Autonomous Systems - Volume 106, August 2018, Pages 107-116

نویسندگان

Megumi Miyashita, Shiro Yano, Toshiyuki Kondo,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : جستجو در مورد آینه و شتاب آن

دسترسی سریع

ارتباط

English Website