Multi-UAV Cooperative Task Assignment Based on Half Random Q-Learning

Open Access

14 December 2021

journal article
research article
Published by MDPI AG in Symmetry

Vol. 13 (12), 2417
https://doi.org/10.3390/sym13122417

Abstract

Unmanned aerial vehicle (UAV) clusters usually face problems such as complex environments, heterogeneous combat subjects, and realistic interference factors in the course of mission assignment. In order to reduce resource consumption and improve the task execution rate, it is very important to develop a reasonable allocation plan for the tasks. Therefore, this paper constructs a heterogeneous UAV multitask assignment model based on several realistic constraints and proposes an improved half-random Q-learning (HR Q-learning) algorithm. The algorithm is based on the Q-learning algorithm under reinforcement learning, and by changing the way the Q-learning algorithm selects the next action in the process of random exploration, the probability of obtaining an invalid action in the random case is reduced, and the exploration efficiency is improved, thus increasing the possibility of obtaining a better assignment scheme, this also ensures symmetry and synergy in the distribution process of the drones. Simulation experiments show that compared with Q-learning algorithm and other heuristic algorithms, HR Q-learning algorithm can improve the performance of task execution, including the ability to improve the rationality of task assignment, increasing the value of gains by 12.12%, this is equivalent to an average of one drone per mission saved, and higher success rate of task execution. This improvement provides a meaningful attempt for UAV task assignment.

Keywords

This publication has 36 references indexed in Scilit:

Modeling and simulation of dynamic ant colony’s labor division for task allocation of UAV swarm
Physica A: Statistical Mechanics and its Applications, 2018
Multi-UAV reconnaissance task allocation for heterogeneous targets using an opposition-based genetic algorithm with double-chromosome encoding
Chinese Journal of Aeronautics, 2018
Unmanned aerial vehicle routing in the presence of threats
Computers & Industrial Engineering, 2018
A reinforcement learning approach to parameter estimation in dynamic job shop scheduling
Computers & Industrial Engineering, 2017
Reinforcement learning improves behaviour from evaluative feedback
Nature, 2015
An Operation-Time Simulation Framework for UAV Swarm Configuration and Mission Planning
Procedia Computer Science, 2013
Heuristic algorithms for assigning and scheduling flight missions in a military aviation unit
Computers & Industrial Engineering, 2011
Multi-heuristic dynamic task allocation using genetic algorithms in a heterogeneous distributed system
Journal of Parallel and Distributed Computing, 2010
Intrinsically Motivated Reinforcement Learning: An Evolutionary Perspective
IEEE Transactions on Autonomous Mental Development, 2010
Reinforcement Learning: An Introduction
IEEE Transactions on Neural Networks, 1998

Cited by 9 articles