Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging

Open Access

4 February 2021

journal article
research article
Published by MDPI AG in Sensors

Vol. 21 (4), 1076
https://doi.org/10.3390/s21041076

Abstract

Unmanned aerial vehicles (UAVs) have been widely used in search and rescue (SAR) missions due to their high flexibility. A key problem in SAR missions is to search and track moving targets in an area of interest. In this paper, we focus on the problem of Cooperative Multi-UAV Observation of Multiple Moving Targets (CMUOMMT). In contrast to the existing literature, we not only optimize the average observation rate of the discovered targets, but we also emphasize the fairness of the observation of the discovered targets and the continuous exploration of the undiscovered targets, under the assumption that the total number of targets is unknown. To achieve this objective, a deep reinforcement learning (DRL)-based method is proposed under the Partially Observable Markov Decision Process (POMDP) framework, where each UAV maintains four observation history maps, and maps from different UAVs within a communication range can be merged to enhance UAVs’ awareness of the environment. A deep convolutional neural network (CNN) is used to process the merged maps and generate the control commands to UAVs. The simulation results show that our policy can enable UAVs to balance between giving the discovered targets a fair observation and exploring the search region compared with other methods.

Keywords

This publication has 22 references indexed in Scilit:

Chaos-enhanced mobility models for multilevel swarms of UAVs
Swarm and Evolutionary Computation, 2018
Mastering the game of Go without human knowledge
Nature, 2017
Cooperative Robots to Observe Moving Targets: Review
IEEE Transactions on Cybernetics, 2016
A cooperative multi-robot team for the surveillance of shipwreck survivors at sea
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2016
Multi-UAVs tracking target in urban environment by model predictive control and Improved Grey Wolf Optimizer
Aerospace Science and Technology, 2016
Human-level control through deep reinforcement learning
Nature, 2015
CNN Features Off-the-Shelf: An Astounding Baseline for Recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2014
Multi-Agent Cooperative Target Search
Sensors, 2014
Cooperative Observation of Multiple Moving Targets: an algorithm and its formalization
The International Journal of Robotics Research, 2007
Distributed Algorithms for Multi-Robot Observation of Multiple Moving Targets
Autonomous Robots, 2002

Cited by 5 articles