On the Existence of Fixed Points for Approximate Value Iteration and Temporal-Difference Learning

No abstract available

This publication has 6 references indexed in Scilit:

A neuro-dynamic programming approach to retailer inventory management
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Mean-Field Theory for Batched TD(λ)
Neural Computation, 1997
An analysis of temporal-difference learning with function approximation
IEEE Transactions on Automatic Control, 1997
Discrete Stochastic Processes
Published by Springer Science and Business Media LLC ,1996
Adaptive Algorithms and Stochastic Approximations
Published by Springer Science and Business Media LLC ,1990
Functional Approximations and Dynamic Programming
Mathematical Tables and Other Aids to Computation, 1959