Algorithms for Reinforcement Learning

Abstract

Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book, we focus on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming. We give a fairly comprehensive ca...

Keywords

This publication has 100 references indexed in Scilit:

Natural Actor-Critic
Neurocomputing, 2008
Learning Representation and Control in Markov Decision Processes: New Frontiers
Foundations and Trends® in Machine Learning, 2007
Opportunities and challenges in using online preference data for vehicle pricing: A case study at General Motors
Journal of Revenue and Pricing Management, 2006
Basis Function Adaptation in Temporal Difference Reinforcement Learning
Annals of Operations Research, 2005
On the Almost Sure Rate of Convergence of Linear Stochastic Approximation Algorithms
IEEE Transactions on Information Theory, 2004
On the Convergence of Stochastic Iterative Dynamic Programming Algorithms
Neural Computation, 1994
Asynchronous stochastic approximation and Q-learning
Machine Learning, 1994
Applied Nonparametric Regression.
Biometrics, 1994
Likelihood ratio gradient estimation for stochastic systems
Communications of the ACM, 1990
A theory of cerebellar function
Mathematical Biosciences, 1971

Cited by 329 articles