An AUV Target-Tracking Method Combining Imitation Learning and Deep Reinforcement Learning

Yubing Mao

Farong Gao

Qizhong Zhang and Zhangyi Yang

Resumen

This study aims to solve the problem of sparse reward and local convergence when using a reinforcement learning algorithm as the controller of an AUV. Based on the generative adversarial imitation (GAIL) algorithm combined with a multi-agent, a multi-agent GAIL (MAG) algorithm is proposed. The GAIL enables the AUV to directly learn from expert demonstrations, overcoming the difficulty of slow initial training of the network. Parallel training of multi-agents reduces the high correlation between samples to avoid local convergence. In addition, a reward function is designed to help training. Finally, the results show that in the unity simulation platform test, the proposed algorithm has a strong optimal decision-making ability in the tracking process.

Palabras claves

imitation learning - deep reinforcement learning - multi-agent - underwater unmanned autonomous robot - target tracking

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 10 Parte: 3 (2022)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL

DOI

https://doi.org/10.3390/jmse10030383

An AUV Target-Tracking Method Combining Imitation Learning and Deep Reinforcement Learning

Revistas destacadas