Redirigiendo al acceso original de articulo en 22 segundos...
Inicio  /  Aerospace  /  Vol: 10 Par: 7 (2023)  /  Artículo
ARTÍCULO
TITULO

A Reinforcement Learning Method Based on an Improved Sampling Mechanism for Unmanned Aerial Vehicle Penetration

Yue Wang    
Kexv Li    
Xing Zhuang    
Xinyu Liu and Hanyu Li    

Resumen

The penetration of unmanned aerial vehicles (UAVs) is an important aspect of UAV games. In recent years, UAV penetration has generally been solved using artificial intelligence methods such as reinforcement learning. However, the high sample demand of the reinforcement learning method poses a significant challenge specifically in the context of UAV games. To improve the sample utilization in UAV penetration, this paper innovatively proposes an improved sampling mechanism called task completion division (TCD) and combines this method with the soft actor critic (SAC) algorithm to form the TCD-SAC algorithm. To compare the performance of the TCD-SAC algorithm with other related baseline algorithms, this study builds a dynamic environment, a UAV game, and conducts training and testing experiments in this environment. The results show that among all the algorithms, the TCD-SAC algorithm has the highest sample utilization rate and the best actual penetration results, and the algorithm has a good adaptability and robustness in dynamic environments.

 Artículos similares

       
 
Siyao Lu, Rui Xu, Zhaoyu Li, Bang Wang and Zhijun Zhao    
The International Lunar Research Station, to be established around 2030, will equip lunar rovers with robotic arms as constructors. Construction requires lunar soil and lunar rovers, for which rovers must go toward different waypoints without encounterin... ver más
Revista: Aerospace

 
Bohdan Petryshyn, Serhii Postupaiev, Soufiane Ben Bari and Armantas Ostreika    
The development of autonomous driving models through reinforcement learning has gained significant traction. However, developing obstacle avoidance systems remains a challenge. Specifically, optimising path completion times while navigating obstacles is ... ver más
Revista: Information

 
Yu-Hung Chang, Chien-Hung Liu and Shingchern D. You    
The dynamic flexible job-shop problem (DFJSP) is a realistic and challenging problem that many production plants face. As the product line becomes more complex, the machines may suddenly break down or resume service, so we need a dynamic scheduling frame... ver más
Revista: Information

 
Jinhui Guo, Xiaoli Zhang, Kun Liang and Guoqiang Zhang    
In recent years, the emergence of large-scale language models, such as ChatGPT, has presented significant challenges to research on knowledge graphs and knowledge-based reasoning. As a result, the direction of research on knowledge reasoning has shifted.... ver más
Revista: Applied Sciences

 
Sungwon Moon, Seolwon Koo, Yujin Lim and Hyunjin Joo    
With recent technological advancements, the commercialization of autonomous vehicles (AVs) is expected to be realized soon. However, it is anticipated that a mixed traffic of AVs and human-driven vehicles (HVs) will persist for a considerable period unti... ver más
Revista: Applied Sciences