Redirigiendo al acceso original de articulo en 23 segundos...
Inicio  /  Algorithms  /  Vol: 15 Par: 4 (2022)  /  Artículo
ARTÍCULO
TITULO

Trinity: Neural Network Adaptive Distributed Parallel Training Method Based on Reinforcement Learning

Yan Zeng    
Jiyang Wu    
Jilin Zhang    
Yongjian Ren and Yunquan Zhang    

Resumen

Deep learning, with increasingly large datasets and complex neural networks, is widely used in computer vision and natural language processing. A resulting trend is to split and train large-scale neural network models across multiple devices in parallel, known as parallel model training. Existing parallel methods are mainly based on expert design, which is inefficient and requires specialized knowledge. Although automatically implemented parallel methods have been proposed to solve these problems, these methods only consider a single optimization aspect of run time. In this paper, we present Trinity, an adaptive distributed parallel training method based on reinforcement learning, to automate the search and tuning of parallel strategies. We build a multidimensional performance evaluation model and use proximal policy optimization to co-optimize multiple optimization aspects. Our experiment used the CIFAR10 and PTB datasets based on InceptionV3, NMT, NASNet and PNASNet models. Compared with Google?s Hierarchical method, Trinity achieves up to 5% reductions in runtime, communication, and memory overhead, and up to a 40% increase in parallel strategy search speeds.

 Artículos similares

       
 
Zengyu Cai, Chunchen Tan, Jianwei Zhang, Liang Zhu and Yuan Feng    
As network technology continues to develop, the popularity of various intelligent terminals has accelerated, leading to a rapid growth in the scale of wireless network traffic. This growth has resulted in significant pressure on resource consumption and ... ver más
Revista: Applied Sciences

 
Sadiq Gbagba, Lorenzo Maccioni and Franco Concli    
In the shipbuilding, construction, automotive, and aerospace industries, welding is still a crucial manufacturing process because it can be utilized to create massive, intricate structures with exact dimensional specifications. These kinds of structures ... ver más
Revista: Applied Sciences

 
Han Zhang, Yadong Wu, Weihan Zhang and Yuling Zhang    
The precise ascertainment of stellar ages is pivotal for astrophysical research into stellar characteristics and galactic dynamics. To address the prevalent challenges of suboptimal accuracy in stellar age determination and limited proficiency in apprehe... ver más
Revista: Applied Sciences

 
Jin-Woo Kong, Byoung-Doo Oh, Chulho Kim and Yu-Seop Kim    
Intracerebral hemorrhage (ICH) is a severe cerebrovascular disorder that poses a life-threatening risk, necessitating swift diagnosis and treatment. While CT scans are the most effective diagnostic tool for detecting cerebral hemorrhage, their interpreta... ver más
Revista: Applied Sciences

 
Shurong Peng, Lijuan Guo, Haoyu Huang, Xiaoxu Liu and Jiayi Peng    
The integration of large-scale wind power into the power grid threatens the stable operation of the power system. Traditional wind power prediction is based on time series without considering the variability between wind turbines in different locations. ... ver más
Revista: Applied Sciences