Redirigiendo al acceso original de articulo en 24 segundos...
ARTÍCULO
TITULO

Path Planning of Coastal Ships Based on Optimized DQN Reward Function

Siyu Guo    
Xiuguo Zhang    
Yiquan Du    
Yisong Zheng and Zhiying Cao    

Resumen

Path planning is a key issue in the field of coastal ships, and it is also the core foundation of ship intelligent development. In order to better realize the ship path planning in the process of navigation, this paper proposes a coastal ship path planning model based on the optimized deep Q network (DQN) algorithm. The model is mainly composed of environment status information and the DQN algorithm. The environment status information provides training space for the DQN algorithm and is quantified according to the actual navigation environment and international rules for collision avoidance at sea. The DQN algorithm mainly includes four components which are ship state space, action space, action exploration strategy and reward function. The traditional reward function of DQN may lead to the low learning efficiency and convergence speed of the model. This paper optimizes the traditional reward function from three aspects: (a) the potential energy reward of the target point to the ship is set; (b) the reward area is added near the target point; and (c) the danger area is added near the obstacle. Through the above optimized method, the ship can avoid obstacles to reach the target point faster, and the convergence speed of the model is accelerated. The traditional DQN algorithm, A* algorithm, BUG2 algorithm and artificial potential field (APF) algorithm are selected for experimental comparison, and the experimental data are analyzed from the path length, planning time, number of path corners. The experimental results show that the optimized DQN algorithm has better stability and convergence, and greatly reduces the calculation time. It can plan the optimal path in line with the actual navigation rules, and improve the safety, economy and autonomous decision-making ability of ship navigation.

 Artículos similares

       
 
Chuanwei Zhang, Xinyue Yang, Rui Zhou and Zhongyu Guo    
In order to solve the problem of low safety and efficiency of underground mine vehicles, a path planning method for underground mine vehicles based on an improved A star (A*) and fuzzy control Dynamic Window Approach (DWA) is proposed. Firstly, the envir... ver más
Revista: Applied Sciences

 
Zilin Zhao, Zhi Cai, Mengmeng Chang and Zhiming Ding    
Unconventional events exacerbate the imbalance between regional transportation demand and limited road network resources. Scientific and efficient path planning serves as the foundation for rapidly restoring equilibrium to the road network. In real large... ver más
Revista: Applied Sciences

 
Siyao Lu, Rui Xu, Zhaoyu Li, Bang Wang and Zhijun Zhao    
The International Lunar Research Station, to be established around 2030, will equip lunar rovers with robotic arms as constructors. Construction requires lunar soil and lunar rovers, for which rovers must go toward different waypoints without encounterin... ver más
Revista: Aerospace

 
Chenglou Liu, Fangfang Xie and Tingwei Ji    
Formation path planning is a significant cornerstone for unmanned aerial vehicle (UAV) swarm intelligence. Previous methods were not suitable for large-scale UAV formation, which suffered from poor formation maintenance and low planning efficiency. To th... ver más
Revista: Aerospace

 
Chaopeng Yang, Jiacai Pan, Kai Wei, Mengjie Lu and Shihao Jia    
Ocean currents make it difficult for unmanned surface vehicles (USVs) to keep a safe distance from obstacles. Effective path planning should adequately consider the effect of ocean currents on USVs. This paper proposes an improved A* algorithm based on a... ver más