To Discount or not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning

Publisher Website

1 January 1994

conference paper
Published by Elsevier BV

p. 164-172
https://doi.org/10.1016/b978-1-55860-335-6.50028-3

Abstract

No abstract available

Keywords

REINFORCEMENT LEARNING

This publication has 6 references indexed in Scilit:

Computationally efficient adaptive control algorithms for Markov chains
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
A Reinforcement Learning Method for Maximizing Undiscounted Rewards
Published by Elsevier BV ,1993
Automatic programming of behavior-based robots using reinforcement learning
Artificial Intelligence, 1992
Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming
Published by Elsevier BV ,1990
Learning to predict by the methods of temporal differences
Machine Learning, 1988
Neuronlike adaptive elements that can solve difficult learning control problems
IEEE Transactions on Systems, Man, and Cybernetics, 1983

Cited by 13 articles