Online learning of shaping rewards in reinforcement learning
- 31 May 2010
- journal article
- Published by Elsevier BV in Neural Networks
- Vol. 23 (4), 541-550
- https://doi.org/10.1016/j.neunet.2010.01.001
Abstract
No abstract availableKeywords
This publication has 16 references indexed in Scilit:
- Qualitative reinforcement learningPublished by Association for Computing Machinery (ACM) ,2006
- Behavior transfer for value-function-based reinforcement learningPublished by Association for Computing Machinery (ACM) ,2005
- Shaping as a method for accelerating reinforcement learningPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Layered LearningLecture Notes in Computer Science, 2000
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learningArtificial Intelligence, 1999
- TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level PlayNeural Computation, 1994
- Reward Functions for Accelerated LearningPublished by Elsevier BV ,1994
- Hierarchical Learning in Stochastic Domains: Preliminary ResultsPublished by Elsevier BV ,1993
- An optimal one-way multigrid algorithm for discrete-time stochastic controlIEEE Transactions on Automatic Control, 1991
- Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic ProgrammingPublished by Elsevier BV ,1990