دانلود رایگان مقاله: یک ترکیب برنامه ریزی خودکار و یادگیری تقویت برای تصمیم گیری کارآمد و قوی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4942156	1436991	2016	28 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

A synthesis of automated planning and reinforcement learning for efficient, robust decision-making

ترجمه فارسی عنوان

یک ترکیب برنامه ریزی خودکار و یادگیری تقویت برای تصمیم گیری کارآمد و قوی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

برنامه ریزی خودکار تقویت یادگیری، ربات مستقل، یادگیری ربات، پاسخ برنامه نویسی،

Automated planning - برنامه ریزی خودکار Autonomous robot - ربات مستقل Answer Set Programming - پاسخ تنظیم برنامه ریزی Reinforcement learning - یادگیری تقویتی Robot learning - یادگیری ربات

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

یک ترکیب برنامه ریزی خودکار و یادگیری تقویت برای تصمیم گیری کارآمد و قوی

چکیده انگلیسی

Automated planning and reinforcement learning are characterized by complementary views on decision making: the former relies on previous knowledge and computation, while the latter on interaction with the world, and experience. Planning allows robots to carry out different tasks in the same domain, without the need to acquire knowledge about each one of them, but relies strongly on the accuracy of the model. Reinforcement learning, on the other hand, does not require previous knowledge, and allows robots to robustly adapt to the environment, but often necessitates an infeasible amount of experience. We present Domain Approximation for Reinforcement LearnING (DARLING), a method that takes advantage of planning to constrain the behavior of the agent to reasonable choices, and of reinforcement learning to adapt to the environment, and increase the reliability of the decision making process. We demonstrate the effectiveness of the proposed method on a service robot, carrying out a variety of tasks in an office building. We find that when the robot makes decisions by planning alone on a given model it often fails, and when it makes decisions by reinforcement learning alone it often cannot complete its tasks in a reasonable amount of time. When employing DARLING, even when seeded with the same model that was used for planning alone, however, the robot can quickly learn a behavior to carry out all the tasks, improves over time, and adapts to the environment as it changes.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Artificial Intelligence - Volume 241, December 2016, Pages 103-130

نویسندگان

Matteo Leonetti, Luca Iocchi, Peter Stone,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : یک ترکیب برنامه ریزی خودکار و یادگیری تقویت برای تصمیم گیری کارآمد و قوی

دسترسی سریع

ارتباط

English Website