REVISTA
Computers

TODAS

Inicio / Computers / Vol: 12 Par: 3 (2023) / Artículo

ARTÍCULO

TITULO

Model Compression for Deep Neural Networks: A Survey

Zhuo Li

Hengyi Li and Lin Meng

Resumen

Currently, with the rapid development of deep learning, deep neural networks (DNNs) have been widely applied in various computer vision tasks. However, in the pursuit of performance, advanced DNN models have become more complex, which has led to a large memory footprint and high computation demands. As a result, the models are difficult to apply in real time. To address these issues, model compression has become a focus of research. Furthermore, model compression techniques play an important role in deploying models on edge devices. This study analyzed various model compression methods to assist researchers in reducing device storage space, speeding up model inference, reducing model complexity and training costs, and improving model deployment. Hence, this paper summarized the state-of-the-art techniques for model compression, including model pruning, parameter quantization, low-rank decomposition, knowledge distillation, and lightweight model design. In addition, this paper discusses research challenges and directions for future work.

Palabras claves

deep neural networks - model compression - model pruning - parameter quantization - low-rank decomposition - knowledge distillation - lightweight model design

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 12 Parte: 3 (2023)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Applied Sciences
Aerospace
Information

DOI

https://doi.org/10.3390/computers12030060

Artículos similares

Experimental Study on the Mechanical Properties of Hybrid Basalt-Polypropylene Fibre-Reinforced Gangue Concrete

Acceso

Yu Yang, Changhao Xin, Yidan Sun, Junzhen Di and Pengfei Liang

Incomplete data indicate that coal gangue is accumulated in China, with over 2000 gangue hills covering an area exceeding 200,000 mu and an annual growth rate surpassing 800 million tons. This accumulation not only signifies a substantial waste of resour... ver más

Revista: Applied Sciences

Integrated 1D Simulation of Aftertreatment System and Chemistry-Based Multizone RCCI Combustion for Optimal Performance with Methane Oxidation Catalyst

Acceso

Alireza Kakoee, Jacek Hunicz and Maciej Mikulski

This paper presents a comprehensive investigation into the design of a methane oxidation catalyst aftertreatment system specifically tailored for the Wärtsilä W31DF natural gas engine which has been converted to a reactivity-controlled compression igniti... ver más

Revista: Journal of Marine Science and Engineering

Stress Prediction Model of Super-High Arch Dams during Their Initial Operation Stages

Acceso

Rongliang Cheng, Xiaofeng Han and Zhiqiang Wu

It is of great significance to identify the spatiotemporal stress distribution characteristics to ensure the safety of a super-high arch dam during the initial operation stage. Taking the 285.5 m-high Xiluodu Dam as an example, the spatiotemporal distrib... ver más

Revista: Water

Sample-Based Gradient Edge and Angular Prediction for VVC Lossless Intra-Coding

Acceso

Guojie Chen and Min Lin

Lossless coding is a compression method in the Versatile Video Coding (VVC) standard, which can compress video without distortion. Lossless coding has great application prospects in fields with high requirements for video quality. Since the current VVC s... ver más

Revista: Applied Sciences

Generally Applicable Q-Table Compression Method and Its Application for Constrained Stochastic Graph Traversal Optimization Problems

Acceso

Tamás Kegyes, Alex Kummer, Zoltán Süle and János Abonyi

We analyzed a special class of graph traversal problems, where the distances are stochastic, and the agent is restricted to take a limited range in one go. We showed that both constrained shortest Hamiltonian pathfinding problems and disassembly line bal... ver más

Revista: Information

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas