A Reinforcement Learning System to Dynamic Movement and Multi-Layer Environments

Open Access

1 January 2014

journal article
Published by Scientific Research Publishing, Inc. in Journal of Intelligent Learning Systems and Applications

Vol. 06 (04), 176-185
https://doi.org/10.4236/jilsa.2014.64014

Abstract

There are many proposed policy-improving systems of Reinforcement Learning (RL) agents which are effective in quickly adapting to environmental change by using many statistical methods, such as mixture model of Bayesian Networks, Mixture Probability and Clustering Distribution, etc. However such methods give rise to the increase of the computational complexity. For another method, the adaptation performance to more complex environments such as multi-layer environments is required. In this study, we used profit-sharing method for the agent to learn its policy, and added a mixture probability into the RL system to recognize changes in the environment and appropriately improve the agent’s policy to adjust to the changing environment. We also introduced a clustering that enables a smaller, suitable selection in order to reduce the computational complexity and simultaneously maintain the system’s performance. The results of experiments presented that the agent successfully learned the policy and efficiently adjusted to the changing in multi-layer environment. Finally, the computational complexity and the decline in effectiveness of the policy improvement were controlled by using our proposed system.

Keywords

This publication has 1 reference indexed in Scilit:

A Theory of Profit Sharing in Dynamic Environment
Lecture Notes in Computer Science, 2000