|
|
|
Yubing Mao, Farong Gao, Qizhong Zhang and Zhangyi Yang
This study aims to solve the problem of sparse reward and local convergence when using a reinforcement learning algorithm as the controller of an AUV. Based on the generative adversarial imitation (GAIL) algorithm combined with a multi-agent, a multi-age...
ver más
|
|
|