Redirigiendo al acceso original de articulo en 17 segundos...
ARTÍCULO
TITULO

An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning

Bowen Xing    
Xiao Wang    
Liu Yang    
Zhenchong Liu and Qingyun Wu    

Resumen

A deep reinforcement learning method to achieve complete coverage path planning for an unmanned surface vehicle (USV) is proposed. This paper firstly models the USV and the workspace required for complete coverage. Then, for the full-coverage path planning task, this paper proposes a preprocessing method for raster maps, which can effectively delete the blank areas that are impossible to cover in the raster map. In this paper, the state matrix corresponding to the preprocessed raster map is used as the input of the deep neural network. The deep Q network (DQN) is used to train the complete coverage path planning strategy of the agent. The improvement of the selection of random actions during training is first proposed. Considering the task of complete coverage path planning, this paper replaces random actions with a set of actions toward the nearest uncovered grid. To solve the problem of the slow convergence speed of the deep reinforcement learning network in full-coverage path planning, this paper proposes an improved method of deep reinforcement learning, which superimposes the final output layer with a dangerous actions matrix to reduce the risk of selection of dangerous actions of USVs during the learning process. Finally, the designed method validates via simulation examples.

 Artículos similares

       
 
Yiming Mo, Lei Wang, Wenqing Hong, Congzhen Chu, Peigen Li and Haiting Xia    
The intrusion of foreign objects on airport runways during aircraft takeoff and landing poses a significant safety threat to air transportation. Small-scale Foreign Object Debris (FOD) cannot be ruled out on time by traditional manual inspection, and the... ver más
Revista: Applied Sciences

 
Jun Li, Javed Iqbal Tanoli, Miao Zhou and Filip Gurkalo    
Based on an improved genetic algorithm and debris flow disaster monitoring network, this study examines the monitoring and early warning method of debris flow expansion behavior, divides the risk of debris flow disaster, and provides a scientific basis f... ver más
Revista: Water

 
Huang Zhang, Ting Huang, Fangguo Zhang, Baodian Wei and Yusong Du    
A bilinear map whose domain and target sets are identical is called a self-bilinear map. Original self-bilinear maps are defined over cyclic groups. Since the map itself reveals information about the underlying cyclic group, the Decisional Diffie?Hellman... ver más
Revista: Information

 
Xianyong Jing, Manyi Hou, Wei Li, Cui Chen, Zhishu Feng and Mingwei Wang    
When Unmanned Aerial Vehicles (UAVs) are used in search and rescue operations, electro-optical (EO) devices are usually used as the detection equipment, and area coverage is used as the main search method. However, the sector scanning mode of EO puts for... ver más
Revista: Aerospace

 
Yongjiang Mao, Wenjuan Ren, Xipeng Li, Zhanpeng Yang and Wei Cao    
With the progress of signal processing technology and the emergence of new system radars, the space electromagnetic environment becomes more and more complex, which puts forward higher requirements for the deinterleaving method of radar signals. Traditio... ver más
Revista: Applied Sciences