Optimal and Autonomous Control Using Reinforcement Learning: A Survey

Abstract
This paper reviews the current state of the art on reinforcement learning (RL)-based feedback control solutions to optimal regulation and tracking of single and multiagent systems. Existing RL solutions to both optimal H₂ and H∞$ control problems, as well as graphical games, will be reviewed. RL methods learn the solution to optimal control and game problems online and using measured data along the system trajectories. We discuss Q-learning and the integral RL algorithm as core algorithms for discrete-time (DT) and continuous-time (CT) systems, respectively. Moreover, we discuss a new direction of off-policy RL for both CT and DT systems. Finally, we review several applications.
Funding Information
  • U.S. NSF (ECCS-1405173)
  • ONR (N00014-17-1-2239)
  • China NSFC (61633007)
  • NATO through the Virginia Tech Startup Fund (SPS G5176)