Layer-Wise Compressive Training for Convolutional Neural Networks

Matteo Grimaldi

Valerio Tenace and Andrea Calimera

Resumen

Convolutional Neural Networks (CNNs) are brain-inspired computational models designed to recognize patterns. Recent advances demonstrate that CNNs are able to achieve, and often exceed, human capabilities in many application domains. Made of several millions of parameters, even the simplest CNN shows large model size. This characteristic is a serious concern for the deployment on resource-constrained embedded-systems, where compression stages are needed to meet the stringent hardware constraints. In this paper, we introduce a novel accuracy-driven compressive training algorithm. It consists of a two-stage flow: first, layers are sorted by means of heuristic rules according to their significance; second, a modified stochastic gradient descent optimization is applied on less significant layers such that their representation is collapsed into a constrained subspace. Experimental results demonstrate that our approach achieves remarkable compression rates with low accuracy loss (<1%).

Palabras claves

deep learning - machine learning - neural networks on-chip - optimization

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 11 Parte: 1 (2019)

MATERIAS

INFRAESTRUCTURA

REVISTAS SIMILARES

Big Data and Cognitive Computing
Buildings
ISPRS International Journal of Geo-Information

DOI

https://doi.org/10.3390/fi11010007

Artículos similares

Discrete Element Bonded-Block Models for Detailed Analysis of Masonry

Acceso

José V. Lemos and Vasilis Sarhosis

A detailed modelling approach to represent masonry at the meso-scale is proposed, based on the discrete element method, considering the nonlinear behavior of the joints and the units. The fracture of units is represented by the bonded-block concept, in w... ver más

Revista: Infrastructures

Determination of Shear Capacity for Load Rating of Concrete Bridges to AS 5100.7-2017

Acceso

Koon Wan Wong and Vanissorn Vimonsatit

According to Modified Compression Field Theory (MCFT), the ultimate shear capacity of a reinforced concrete section depends on load effects (shear, moment, torsion, and axial force) caused by factored design loads. In many design standards, including Aus... ver más

Revista: Infrastructures

Data Compression Approach for Long-Term Monitoring of Pavement Structures

Acceso

Mario Manosalvas-Paredes, Nizar Lajnef, Karim Chatti, Kenji Aono, Juliette Blanc, Nick Thom, Gordon Airey and Davide Lo Presti

Pavement structures are designed to withstand continuous damage during their design life. Damage starts as soon as the pavement is open to traffic and increases with time. If maintenance activities are not considered in the initial design or considered b... ver más

Revista: Infrastructures

Recycled concrete production. Multiple recycling of concrete coarse aggregates

Acceso

Jorge Manuel Caliço Lopes de Brito, Ana Paula Gonçalves, José Roberto dos Santos Pág. Page 33 - 40

Recycled aggregates, such as the ones used in making the sub-base and base layers of roads and the ones used as replacement of natural aggregates inconcrete production, may give an important contribution towards decreasing the negative consequences that ... ver más

Revista: Revista Ingeniería de la Construcción

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas