Robust ASV Navigation Through Ground to Water Cross-Domain Deep Reinforcement Learning
Open Access
- 20 September 2021
- journal article
- research article
- Published by Frontiers Media SA in Frontiers in Robotics and AI
Abstract
This paper presents a framework to alleviate the Deep Reinforcement Learning (DRL) training data sparsity problem that is present in challenging domains by creating a DRL agent training and vehicle integration methodology. The methodology leverages accessible domains to train an agent to solve navigational problems such as obstacle avoidance and allows the agent to generalize to challenging and inaccessible domains such as those present in marine environments with minimal further training. This is done by integrating a DRL agent at a high level of vehicle control and leveraging existing path planning and proven low-level control methodologies that are utilized in multiple domains. An autonomy package with a tertiary multilevel controller is developed to enable the DRL agent to interface at the prescribed high control level and thus be separated from vehicle dynamics and environmental constraints. An example Deep Q Network (DQN) employing this methodology for obstacle avoidance is trained in a simulated ground environment, and then its ability to generalize across domains is experimentally validated. Experimental validation utilized a simulated water surface environment and real-world deployment of ground and water robotic platforms. This methodology, when used, shows that it is possible to leverage accessible and data rich domains, such as ground, to effectively develop marine DRL agents for use on Autonomous Surface Vehicle (ASV) navigation. This will allow rapid and iterative agent development without the risk of ASV loss, the cost and logistic overhead of marine deployment, and allow landlocked institutions to develop agents for marine applications.Funding Information
- National Science Foundation (1921060)
- Office of Naval Research (N00014-20-1-2085)
This publication has 36 references indexed in Scilit:
- Review of Plant Identification Based on Image ProcessingArchives of Computational Methods in Engineering, 2016
- The Cityscapes Dataset for Semantic Urban Scene UnderstandingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2016
- A Machine Learning Approach to Visual Perception of Forest Trails for Mobile RobotsIEEE Robotics and Automation Letters, 2015
- Study on manoeuverability and control of an autonomous Wave Adaptive Modular Vessel (WAM-V) for ocean observationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- Human-level control through deep reinforcement learningNature, 2015
- Line-of-Sight Path Following for Dubins Paths With Adaptive Sideslip Compensation of Drift ForcesIEEE Transactions on Control Systems Technology, 2014
- MuJoCo: A physics engine for model-based controlPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- Basic navigation, guidance and control of an Unmanned Surface VehicleAutonomous Robots, 2008
- SWORDFISH: an Autonomous Surface Vehicle for Network Centric OperationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- On Curves of Minimal Length with a Constraint on Average Curvature, and with Prescribed Initial and Terminal Positions and TangentsAmerican Journal of Mathematics, 1957