Incremental Multi-Step Q-Learning

Publisher Website

1 January 1994

book chapter
Published by Elsevier BV

p. 226-232
https://doi.org/10.1016/b978-1-55860-335-6.50035-0

Abstract

No abstract available

Keywords

This publication has 8 references indexed in Scilit:

Prioritized sweeping: Reinforcement learning with less data and less time
Machine Learning, 1993
On the Convergence of Stochastic Iterative Dynamic Programming Algorithms
Published by Defense Technical Information Center (DTIC) ,1993
Efficient Learning and Planning Within the Dyna Framework
Adaptive Behavior, 1993
The convergence of TD(?) for general ?
Machine Learning, 1992
Q-learning
Machine Learning, 1992
Consistency of HDP applied to a simple reinforcement learning problem
Neural Networks, 1990
Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming
Published by Elsevier BV ,1990
Learning to predict by the methods of temporal differences
Machine Learning, 1988

Cited by 46 articles