Redirigiendo al acceso original de articulo en 19 segundos...
Inicio  /  Drones  /  Vol: 7 Par: 5 (2023)  /  Artículo
ARTÍCULO
TITULO

Safe Reinforcement Learning for Transition Control of Ducted-Fan UAVs

Yanbo Fu    
Wenjie Zhao and Liu Liu    

Resumen

Ducted-fan tail-sitter unmanned aerial vehicles (UAVs) provide versatility and unique benefits, attracting significant attention in various applications. This study focuses on developing a safe reinforcement learning method for back-transition control between level flight mode and hover mode for ducted-fan tail-sitter UAVs. Our method enables transition control with a minimal altitude change and transition time while adhering to the velocity constraint. We employ the Trust Region Policy Optimization, Proximal Policy Optimization with Lagrangian, and Constrained Policy Optimization (CPO) algorithms for controller training, showcasing the superiority of the CPO algorithm and the necessity of the velocity constraint. The transition trajectory achieved using the CPO algorithm closely resembles the optimal trajectory obtained via the well-known GPOPS-II software with the SNOPT solver. Meanwhile, the CPO algorithm also exhibits strong robustness under unknown perturbations of UAV model parameters and wind disturbance.

 Artículos similares

       
 
Dillip Kumar Das    
There is a changed perspective regarding the development of cities and increasingly many countries in the West and some developing countries, as in South Africa, are making concerted attempts to transform their cities to smart cities. Using the context o... ver más

 
The existence of high proportional distributed energy resources in energy Internet (EI) scenarios has a strong impact on the power supply-demand balance of the EI system. Decision-making optimization research that focuses on the transient voltage stabili... ver más
Revista: Energies

 
Guillem Muñoz, Cristina Barrado, Ender Çetin and Esther Salami    
Drones are expected to be used extensively for delivery tasks in the future. In the absence of obstacles, satellite based navigation from departure to the geo-located destination is a simple task. When obstacles are known to be in the path, pilots must b... ver más
Revista: Drones

 
Julie Holmquist    
Harsh environmental conditions create an added challenge to the durability of structures in Middle Eastern regions such as the United Arab Emirates (UAE) and Qatar. Engineers in this region must take into account the effect that corrosive conditions will... ver más

 
Deyu Qian, Nong Zhang, Dongjiang Pan, Zhengzheng Xie, Hideki Shimada, Yang Wang, Chenghao Zhang and Nianchao Zhang    
The stability of underground openings is pivotal to sustainable safe mining in underground coal mines. To determine the stability and tunneling safety issues in 800-m-deep underground openings through large fault zones in argillaceous rocks in the Guqiao... ver más
Revista: Sustainability