Inicio  /  Applied Sciences  /  Vol: 12 Par: 6 (2022)  /  Artículo
ARTÍCULO
TITULO

Deep Deterministic Policy Gradient with Reward Function Based on Fuzzy Logic for Robotic Peg-in-Hole Assembly Tasks

Ziyue Wang    
Fengming Li    
Yu Men    
Tianyu Fu    
Xuting Yang and Rui Song    

Resumen

Robot automatic assembly of weak stiffness parts is difficult due to potential deformation during assembly. The robot manipulation cannot adapt to the dynamic contact changes during the assembly process. A robot assembly skill learning system is designed by combining the compliance control and deep reinforcement, which could acquire a better robot assembly strategy. In this paper, a robot assembly strategy learning method based on variable impedance control is proposed to solve the robot assembly contact tasks. During the assembly process, the quality evaluation is designed based on fuzzy logic, and the impedance parameters in the assembly process are studied with a deep deterministic policy gradient. Finally, the effectiveness of the method is verified using the KUKA iiwa robot in the weak stiffness peg-in-hole assembly. Experimental results show that the robot obtains the robot assembly strategy with variable compliant in the process of weak stiffness peg-in-hole assembly. Compared with the previous methods, the assembly success rate of the proposed method reaches 100%.

 Artículos similares

       
 
Jianya Yuan, Mengxue Han, Hongjian Wang, Bo Zhong, Wei Gao and Dan Yu    
Collision avoidance planning has always been a hot and important issue in the field of unmanned aircraft research. In this article, we describe an online collision avoidance planning algorithm for autonomous underwater vehicle (AUV) autonomous navigation... ver más

 
Xi Lyu, Yushan Sun, Lifeng Wang, Jiehui Tan and Liwen Zhang    
This study aims to solve the problems of sparse reward, single policy, and poor environmental adaptability in the local motion planning task of autonomous underwater vehicles (AUVs). We propose a two-layer deep deterministic policy gradient algorithm-bas... ver más

 
Hao Chen, Chuanqiang Gao, Jifei Wu, Kai Ren and Weiwei Zhang    
Transonic buffet is a phenomenon of large self-excited shock oscillations caused by shock wave-boundary layer interaction, which is one of the common flow instability problems in aeronautical engineering. This phenomenon involves unsteady flow, which mak... ver más
Revista: Aerospace

 
Wanli Li, Jiong Li, Ningbo Li, Lei Shao and Mingjie Li    
Concerned with the problem of interceptor midcourse guidance trajectory online planning satisfying multiple constraints, an online midcourse guidance trajectory planning method based on deep reinforcement learning (DRL) is proposed. The Markov decision p... ver más
Revista: Aerospace

 
Jiqing Du, Dan Zhou, Wei Wang and Sachiyo Arai    
The Deep Reinforcement Learning (DRL) algorithm is an optimal control method with generalization capacity for complex nonlinear coupled systems. However, the DRL agent maintains control command saturation and response overshoot to achieve the fastest res... ver más