Exploitation-Oriented Learning with Deep Learning – Introducing Profit Sharing to a Deep Q-Network –

Open Access

Abstract

Currently, deep learning is attracting significant interest. Combining deep Q-networks (DQNs) and Q-learning has produced excellent results for several Atari 2600 games. In this paper, we propose an exploitation-oriented learning (XoL) method that incorporates deep learning to reduce the number of trial-and-error searches. We focus on a profit sharing (PS) method that is an XoL method, and combine it with a DQN to propose a DQNwithPS method. This method is compared with a DQN in Atari 2600 games. We demonstrate that the proposed DQNwithPS method can learn stably with fewer trial-and-error searches than required by only a DQN.

Keywords

This publication has 11 references indexed in Scilit:

The Arcade Learning Environment: An Evaluation Platform for General Agents
Journal of Artificial Intelligence Research, 2013
Introduction of Fixed Mode States into Online Reinforcement Learning with Penalties and Rewards and its Application to Biped Robot Waist Trajectory Generation
Journal of Advanced Computational Intelligence and Intelligent Informatics, 2012
Proposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning
Lecture Notes in Computer Science, 2012
Partially Observable Markov Decision Processes
Published by Springer Science and Business Media LLC ,2012
Exploitation-Oriented Learning PS-r^#
Journal of Advanced Computational Intelligence and Intelligent Informatics, 2009
A New Improved Penalty Avoiding Rational Policy Making Algorithm for Keepaway with Continuous State Spaces
Journal of Advanced Computational Intelligence and Intelligent Informatics, 2009
Reinforcement Learning for RoboCup Soccer Keepaway
Adaptive Behavior, 2005
Rationality of reward sharing in multi-agent reinforcement learning
New Generation Computing, 2001
k-Certainty Exploration Method: an action selector to identify the environment in reinforcement learning
Artificial Intelligence, 1997
Q-learning
Machine Learning, 1992

Cited by 17 articles