Redirigiendo al acceso original de articulo en 18 segundos...
Inicio  /  Applied Sciences  /  Vol: 12 Par: 6 (2022)  /  Artículo
ARTÍCULO
TITULO

Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics

Honghu Xue    
Benedikt Hein    
Mohamed Bakr    
Georg Schildbach    
Bengt Abel and Elmar Rueckert    

Resumen

We propose a deep reinforcement learning approach for solving a mapless navigation problem in warehouse scenarios. In our approach, an automatic guided vehicle is equipped with two LiDAR sensors and one frontal RGB camera and learns to perform a targeted navigation task. The challenges reside in the sparseness of positive samples for learning, multi-modal sensor perception with partial observability, the demand for accurate steering maneuvers together with long training cycles. To address these points, we propose NavACL-Q as an automatic curriculum learning method in combination with a distributed version of the soft actor-critic algorithm. The performance of the learning algorithm is evaluated exhaustively in a different warehouse environment to validate both robustness and generalizability of the learned policy. Results in NVIDIA Isaac Sim demonstrates that our trained agent significantly outperforms the map-based navigation pipeline provided by NVIDIA Isaac Sim with an increased agent-goal distance of 3 m and a wider initial relative agent-goal rotation of approximately 45° 45 ° . The ablation studies also suggest that NavACL-Q greatly facilitates the whole learning process with a performance gain of roughly 40% 40 % compared to training with random starts and a pre-trained feature extractor manifestly boosts the performance by approximately 60% 60 % .

 Artículos similares

       
 
Jose M. Bernal-de-Lázaro     Pág. 74 - 81
This article summarizes the main contributions of the PhD thesis titled: "Application of learning techniques based on kernel methods for the fault diagnosis in Industrial processes". This thesis focuses on the analysis and design of fault diagnosis syste... ver más

 
Hugo López-Fernández     Pág. 22 - 25
Mass spectrometry using matrix assisted laser desorption ionization coupled to time of flight analyzers (MALDI-TOF MS) has become popular during the last decade due to its high speed, sensitivity and robustness for detecting proteins and peptides. This a... ver más

 
Jiahao Chen, Jiaxin Li, Deqian Zheng, Qianru Zheng, Jiayi Zhang, Meimei Wu and Chaosai Liu    
The multi-field coupling of grain piles in grain silos is a focal point of research in the field of grain storage. The porosity of grain piles is a critical parameter that affects the heat and moisture transfer in grain piles. To investigate the distribu... ver más
Revista: Applied Sciences

 
Futo Ueda, Hiroto Tanouchi, Nobuyuki Egusa and Takuya Yoshihiro    
River water-level prediction is crucial for mitigating flood damage caused by torrential rainfall. In this paper, we attempt to predict river water levels using a deep learning model based on radar rainfall data instead of data from upstream hydrological... ver más
Revista: Water

 
Saikat Das, Mohammad Ashrafuzzaman, Frederick T. Sheldon and Sajjan Shiva    
The distributed denial of service (DDoS) attack is one of the most pernicious threats in cyberspace. Catastrophic failures over the past two decades have resulted in catastrophic and costly disruption of services across all sectors and critical infrastru... ver más
Revista: Algorithms