کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4948643 1439844 2017 14 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Posture self-stabilizer of a biped robot based on training platform and reinforcement learning
ترجمه فارسی عنوان
خود تثبیت کننده وضعیت یک ربات دوتایی بر اساس پلت فرم آموزش و یادگیری تقویت شده است
کلمات کلیدی
سیستم های یادگیری و تطبیقی روبات های ساق پا، رباتیک تکاملی آموزش پایداری، انتزاع خودکار حالت دولت، خود تثبیت کننده،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
In order to solve the problem of stability control for biped robots, the concept of stability training is proposed by using a training platform to exert random disturbance with amplitude limitation on robots that are to be trained. In this work, an approach to achieve a posture stabilizing capability based on stability training and reinforcement learning is explored and verified by simulations. An automatic abstraction method for state space is proposed by using the Gauss basis function and inner evaluation indexes to speed up the learning process. Hierarchical structure stabilizer using the Monte Carlo method is designed according to the concept of variable ZMP. Training samples are extracted from the state transition of the stability training process using balance controllers based on the robot dynamic model. The stabilizers are trained with and without applying the automatic abstraction of state space. Then simulation tests of them are conducted under conditions where the training platform exerts amplitude-limited random disturbances on the robot. Also, the influence of the model errors is studied by introducing deviations of the CoM position during the simulation tests. By comparing the simulation results of two learning stabilizers and the model-based balance controller, it is demonstrated that the designed stabilizer can achieve approximate success rate of the ideal model-based balance controller and exert all the driving ability of the robot under the large disturbance condition of ±30° inclination of the platform. Also, the effects of the model error can be overcome by retraining using state transition data with the model error.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Robotics and Autonomous Systems - Volume 98, December 2017, Pages 42-55
نویسندگان
, ,