Robust ASV Navigation Through Ground to Water Cross-Domain Deep Reinforcement Learning

Open Access

20 September 2021

journal article
research article
Published by Frontiers Media SA in Frontiers in Robotics and AI

Abstract

This paper presents a framework to alleviate the Deep Reinforcement Learning (DRL) training data sparsity problem that is present in challenging domains by creating a DRL agent training and vehicle integration methodology. The methodology leverages accessible domains to train an agent to solve navigational problems such as obstacle avoidance and allows the agent to generalize to challenging and inaccessible domains such as those present in marine environments with minimal further training. This is done by integrating a DRL agent at a high level of vehicle control and leveraging existing path planning and proven low-level control methodologies that are utilized in multiple domains. An autonomy package with a tertiary multilevel controller is developed to enable the DRL agent to interface at the prescribed high control level and thus be separated from vehicle dynamics and environmental constraints. An example Deep Q Network (DQN) employing this methodology for obstacle avoidance is trained in a simulated ground environment, and then its ability to generalize across domains is experimentally validated. Experimental validation utilized a simulated water surface environment and real-world deployment of ground and water robotic platforms. This methodology, when used, shows that it is possible to leverage accessible and data rich domains, such as ground, to effectively develop marine DRL agents for use on Autonomous Surface Vehicle (ASV) navigation. This will allow rapid and iterative agent development without the risk of ASV loss, the cost and logistic overhead of marine deployment, and allow landlocked institutions to develop agents for marine applications.

Funding Information

National Science Foundation (1921060)
Office of Naval Research (N00014-20-1-2085)

This publication has 36 references indexed in Scilit:

Review of Plant Identification Based on Image Processing
Archives of Computational Methods in Engineering, 2016
The Cityscapes Dataset for Semantic Urban Scene Understanding
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2016
A Machine Learning Approach to Visual Perception of Forest Trails for Mobile Robots
IEEE Robotics and Automation Letters, 2015
Study on manoeuverability and control of an autonomous Wave Adaptive Modular Vessel (WAM-V) for ocean observation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
Human-level control through deep reinforcement learning
Nature, 2015
Line-of-Sight Path Following for Dubins Paths With Adaptive Sideslip Compensation of Drift Forces
IEEE Transactions on Control Systems Technology, 2014
MuJoCo: A physics engine for model-based control
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2012
Basic navigation, guidance and control of an Unmanned Surface Vehicle
Autonomous Robots, 2008
SWORDFISH: an Autonomous Surface Vehicle for Network Centric Operations
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2007
On Curves of Minimal Length with a Constraint on Average Curvature, and with Prescribed Initial and Terminal Positions and Tangents
American Journal of Mathematics, 1957

Cited by 2 articles