Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
413451	680513	2011	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Dynamic Movement Primitives - اولیه اولیه حرکت پویا Value function approximation - تقریب تابع ارزش Reinforcement learning - یادگیری تقویتی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives

چکیده انگلیسی

When describing robot motion with dynamic movement primitives (DMPs), goal (trajectory endpoint), shape and temporal scaling parameters are used. In reinforcement learning with DMPs, usually goals and temporal scaling parameters are predefined and only the weights for shaping a DMP are learned. Many tasks, however, exist where the best goal position is not a priori known, requiring to learn it. Thus, here we specifically address the question of how to simultaneously combine goal and shape parameter learning. This is a difficult problem because learning of both parameters could easily interfere in a destructive way. We apply value function approximation techniques for goal learning and direct policy search methods for shape learning. Specifically, we use “policy improvement with path integrals” and “natural actor critic” for the policy search. We solve a learning-to-pour-liquid task in simulations as well as using a Pa10 robot arm. Results for learning from scratch, learning initialized by human demonstration, as well as for modifying the tool for the learned DMPs are presented. We observe that the combination of goal and shape learning is stable and robust within large parameter regimes. Learning converges quickly even in the presence of disturbances, which makes this combined method suitable for robotic applications.

► Combination of goal and shape learning for dynamic movement primitives.
► Combination of value function approximation and direct policy search methods.
► Comparison of direct policy search methods.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Robotics and Autonomous Systems - Volume 59, Issue 11, November 2011, Pages 910–922

نویسندگان

Minija Tamosiunaite, Bojan Nemec, Aleš Ude, Florentin Wörgötter,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives

دسترسی سریع

ارتباط

English Website