Inicio  /  Information  /  Vol: 12 Par: 8 (2021)  /  Artículo
ARTÍCULO
TITULO

Geometric Regularization of Local Activations for Knowledge Transfer in Convolutional Neural Networks

Ilias Theodorakopoulos    
Foteini Fotopoulou and George Economou    

Resumen

In this work, we propose a mechanism for knowledge transfer between Convolutional Neural Networks via the geometric regularization of local features produced by the activations of convolutional layers. We formulate appropriate loss functions, driving a ?student? model to adapt such that its local features exhibit similar geometrical characteristics to those of an ?instructor? model, at corresponding layers. The investigated functions, inspired by manifold-to-manifold distance measures, are designed to compare the neighboring information inside the feature space of the involved activations without any restrictions in the features? dimensionality, thus enabling knowledge transfer between different architectures. Experimental evidence demonstrates that the proposed technique is effective in different settings, including knowledge-transfer to smaller models, transfer between different deep architectures and harnessing knowledge from external data, producing models with increased accuracy compared to a typical training. Furthermore, results indicate that the presented method can work synergistically with methods such as knowledge distillation, further increasing the accuracy of the trained models. Finally, experiments on training with limited data show that a combined regularization scheme can achieve the same generalization as a non-regularized training with 50% of the data in the CIFAR-10 classification task.

 Artículos similares

       
 
Ronnie Figueiredo, Mohammad Soliman, Alamir N. Al-Alawi and Tarek Fatnassi    
Although several prior studies have outlined and examined models associated with knowledge and innovation in different fields, the literature lacks any solid insights combining the Triple Helix model and the Spinner Innovation model and ascertaining thei... ver más

 
Jingying Zhang and Tengfei Bao    
Crack detection is an important component of dam safety monitoring. Detection methods based on deep convolutional neural networks (DCNNs) are widely used for their high efficiency and safety. Most existing DCNNs with high accuracy are too complex for use... ver más
Revista: Water

 
Xiaoling Wang, Qi Kang, Mengchu Zhou, Zheng Fan and Aiiad Albeshri    
Multi-task optimization (MTO) is a novel emerging evolutionary computation paradigm. It focuses on solving multiple optimization tasks concurrently while improving optimization performance by utilizing similarities among tasks and historical optimization... ver más
Revista: Applied Sciences

 
Zhigang Song, Daisong Li, Zhongyou Chen and Wenqin Yang    
The unsupervised domain-adaptive vehicle re-identification approach aims to transfer knowledge from a labeled source domain to an unlabeled target domain; however, there are knowledge differences between the target domain and the source domain. To mitiga... ver más
Revista: Applied Sciences

 
Wajeeh Daher, Hussam Diab and Anwar Rayan    
In recent years, artificial intelligence (AI) has emerged as a valuable resource for teaching and learning, and it has also shown promise as a tool to help solve problems. A tool that has gained attention in education is ChatGPT, which supports teaching ... ver más
Revista: Information