BIPED WALKING PATTERN GENERATION USING REINFORCEMENT LEARNING

1 March 2009

journal article
Published by World Scientific Pub Co Pte Ltd in International Journal of Humanoid Robotics

Vol. 6 (1), 1-21
https://doi.org/10.1142/s021984360900167x

Abstract

In this research, a stable biped walking pattern is generated using reinforcement learning. The biped walking pattern is chosen as a simple third order polynomial. To complete it, four boundary conditions are needed. The initial position and velocity and the final position and velocity of the joint are selected as boundary conditions. In order to find the proper boundary condition value, a reinforcement learning algorithm is used. Also desired motion or posture can be achieved using the initial and final positions. The final velocity of the walking pattern is chosen as a learning parameter. To test the algorithm, a simulator that takes into consideration the whole model of the robot and the environment is developed. The algorithm is verified through a simulation.

Keywords

This publication has 3 references indexed in Scilit:

Experimental realization of dynamic walking for a human-riding biped robot, HUBO FX-1
Advanced Robotics, 2007
Dynamic bipedal walking assisted by learning
Robotica, 2002
Biped dynamic walking using reinforcement learning
Robotics and Autonomous Systems, 1997

Cited by 9 articles