ARTÍCULO
TITULO

Implementing a Timing Error-Resilient and Energy-Efficient Near-Threshold Hardware Accelerator for Deep Neural Network Inference

Noel Daniel Gundi    
Pramesh Pandey    
Sanghamitra Roy and Koushik Chakraborty    

Resumen

Increasing processing requirements in the Artificial Intelligence (AI) realm has led to the emergence of domain-specific architectures for Deep Neural Network (DNN) applications. Tensor Processing Unit (TPU), a DNN accelerator by Google, has emerged as a front runner outclassing its contemporaries, CPUs and GPUs, in performance by 15×?30×. TPUs have been deployed in Google data centers to cater to the performance demands. However, a TPU?s performance enhancement is accompanied by a mammoth power consumption. In the pursuit of lowering the energy utilization, this paper proposes PREDITOR?a low-power TPU operating in the Near-Threshold Computing (NTC) realm. PREDITOR uses mathematical analysis to mitigate the undetectable timing errors by boosting the voltage of the selective multiplier-and-accumulator units at specific intervals to enhance the performance of the NTC TPU, thereby ensuring a high inference accuracy at low voltage. PREDITOR offers up to 3×?5× improved performance in comparison to the leading-edge error mitigation schemes with a minor loss in accuracy.

 Artículos similares

       
 
Lieschen Venter, Stephan Visagie     Pág. 64 - 84
In this paper the assignment of cross-trained and temporary workers to tasks on an assembly line is investigated. Cross-trained workers are skilled to perform more than one task on the assembly line in the production process. Temporary workers are... ver más