1   Artículos

 
en línea
Yubing Mao, Farong Gao, Qizhong Zhang and Zhangyi Yang    
This study aims to solve the problem of sparse reward and local convergence when using a reinforcement learning algorithm as the controller of an AUV. Based on the generative adversarial imitation (GAIL) algorithm combined with a multi-agent, a multi-age... ver más
Revista: Journal of Marine Science and Engineering    Formato: Electrónico

« Anterior     Página: 1 de 1     Siguiente »