Experimental Study on Behavior Acquisition of Mobile Robot by Deep Q-Network

Abstract

Deep Q-network (DQN) is one of the most famous methods of deep reinforcement learning. DQN approximates the action-value function using Convolutional Neural Network (CNN) and updates it using Q-learning. In this study, we applied DQN to robot behavior learning in a simulation environment. We constructed the simulation environment for a two-wheeled mobile robot using the robot simulation software, Webots. The mobile robot acquired good behavior such as avoiding walls and moving along a center line by learning from high-dimensional visual information supplied as input data. We propose a method that reuses the best target network so far when the learning performance suddenly falls. Moreover, we incorporate Profit Sharing method into DQN in order to accelerate learning. Through the simulation experiment, we confirmed that our method is effective.

Keywords

This publication has 7 references indexed in Scilit:

Acquisition of Goal-Oriented Behavior for Snake-Like Robot by CPG and Reinforcement Learning
Journal of Japan Society for Fuzzy Theory and Intelligent Informatics, 2017
Expectations of Robot Field from Artificial Intelligence Field
Journal of the Robotics Society of Japan, 2017
Mobile robots exploration through cnn-based reinforcement learning
Robotics and Biomimetics, 2016
Human-level control through deep reinforcement learning
Nature, 2015
Q-PSP Learning: An Exploitation-Oriented Q-Learning Algorithm and Its Applications
Transactions of the Society of Instrument and Control Engineers, 1999
Gradient-based learning applied to document recognition
Proceedings of the IEEE, 1998
Q-learning
Machine Learning, 1992

Cited by 6 articles