Redirigiendo al acceso original de articulo en 24 segundos...
Inicio  /  Algorithms  /  Vol: 13 Par: 9 (2020)  /  Artículo
ARTÍCULO
TITULO

Feasibility Analysis and Application of Reinforcement Learning Algorithm Based on Dynamic Parameter Adjustment

Menglin Li    
Xueqiang Gu    
Chengyi Zeng and Yuan Feng    

Resumen

Reinforcement learning, as a branch of machine learning, has been gradually applied in the control field. However, in the practical application of the algorithm, the hyperparametric approach to network settings for deep reinforcement learning still follows the empirical attempts of traditional machine learning (supervised learning and unsupervised learning). This method ignores part of the information generated by agents exploring the environment contained in the updating of the reinforcement learning value function, which will affect the performance of the convergence and cumulative return of reinforcement learning. The reinforcement learning algorithm based on dynamic parameter adjustment is a new method for setting learning rate parameters of deep reinforcement learning. Based on the traditional method of setting parameters for reinforcement learning, this method analyzes the advantages of different learning rates at different stages of reinforcement learning and dynamically adjusts the learning rates in combination with the temporal-difference (TD) error values to achieve the advantages of different learning rates in different stages to improve the rationality of the algorithm in practical application. At the same time, by combining the Robbins?Monro approximation algorithm and deep reinforcement learning algorithm, it is proved that the algorithm of dynamic regulation learning rate can theoretically meet the convergence requirements of the intelligent control algorithm. In the experiment, the effect of this method is analyzed through the continuous control scenario in the standard experimental environment of ?Car-on-The-Hill? of reinforcement learning, and it is verified that the new method can achieve better results than the traditional reinforcement learning in practical application. According to the model characteristics of the deep reinforcement learning, a more suitable setting method for the learning rate of the deep reinforcement learning network proposed. At the same time, the feasibility of the method has been proved both in theory and in the application. Therefore, the method of setting the learning rate parameter is worthy of further development and research.

 Artículos similares

       
 
Young-Cheol Kim, Dong-Hyeop Kim and Sang-Woo Kim    
To achieve the commercialization of electric vertical takeoff and landing (eVTOL) aircrafts, which have recently garnered attention as the next-generation means of transportation, objective certification based on rigorous procedures is essential. With th... ver más
Revista: Aerospace

 
Russell Shomberg, Michael Jakuba and Dana Yoerger    
We propose a design for a float capable of harvesting wave energy while fully submerged. The proposed design could theoretically operate indefinitely without ever breaching the surface. We developed and validated design guidelines for the proposed float ... ver más

 
Qirui Bo, Junwei Liu, Wenchang Shang, Ankit Garg, Xiaoru Jia and Kaiyue Sun    
Nowadays, the use of new compound chemical stabilizers to treat marine clay has gained significant attention. However, the complex non-linear relationship between the influencing factors and the unconfined compressive strength of chemically treated marin... ver más

 
Weidong Zhao, Bernt Johan Leira, Knut Vilhelm Høyland, Ekaterina Kim, Guoqing Feng and Huilong Ren    
This paper presents a framework for structural analysis of icebreakers during ramming of first-year ice ridges. The framework links the ice-ridge load and the structural analysis based on the physical characteristics of ship?ice-ridge interactions. A shi... ver más

 
Liming Li and Zeang Zhao    
To effectively enhance the adaptability of earthquake rescue robots in dynamic environments and complex tasks, there is an urgent need for an evaluation method that quantifies their performance and facilitates the selection of rescue robots with optimal ... ver más
Revista: Applied Sciences