Redirigiendo al acceso original de articulo en 23 segundos...
Inicio  /  Information  /  Vol: 13 Par: 1 (2022)  /  Artículo
ARTÍCULO
TITULO

Impact on Inference Model Performance for ML Tasks Using Real-Life Training Data and Synthetic Training Data from GANs

Ulrike Faltings    
Tobias Bettinger    
Swen Barth and Michael Schäfer    

Resumen

Collecting and labeling of good balanced training data are usually very difficult and challenging under real conditions. In addition to classic modeling methods, Generative Adversarial Networks (GANs) offer a powerful possibility to generate synthetic training data. In this paper, we evaluate the hybrid usage of real-life and generated synthetic training data in different fractions and the effect on model performance. We found that a usage of up to 75% synthetic training data can compensate for both time-consuming and costly manual annotation while the model performance in our Deep Learning (DL) use case stays in the same range compared to a 100% share in hand-annotated real images. Using synthetic training data specifically tailored to induce a balanced dataset, special care can be taken concerning events that happen only on rare occasions and a prompt industrial application of ML models can be executed without too much delay, making these feasible and economically attractive for a wide scope of industrial applications in process and manufacturing industries. Hence, the main outcome of this paper is that our methodology can help to leverage the implementation of many different industrial Machine Learning and Computer Vision applications by making them economically maintainable. It can be concluded that a multitude of industrial ML use cases that require large and balanced training data containing all information that is relevant for the target model can be solved in the future following the findings that are presented in this study.

 Artículos similares

       
 
Somayeh Shahrabadi, Telmo Adão, Emanuel Peres, Raul Morais, Luís G. Magalhães and Victor Alves    
The proliferation of classification-capable artificial intelligence (AI) across a wide range of domains (e.g., agriculture, construction, etc.) has been allowed to optimize and complement several tasks, typically operationalized by humans. The computatio... ver más
Revista: Algorithms

 
Liang Liu, Tianbin Li and Chunchi Ma    
Three-dimensional (3D) models provide the most intuitive representation of geological conditions. Traditional modeling methods heavily depend on technicians? expertise and lack ease of updating. In this study, we introduce a deep learning-based method fo... ver más
Revista: Applied Sciences

 
Kang Cao, Yongjie Zhang and Jianfei Feng    
As aviation technology advances, numerous new aircraft enter the market. These not only offer airlines technological and fuel efficiency advantages but also present the challenge of how to conduct pilots? aircraft-type transition training efficiently and... ver más
Revista: Aerospace

 
David Naseh, Mahdi Abdollahpour and Daniele Tarchi    
This paper explores the practical implementation and performance analysis of distributed learning (DL) frameworks on various client platforms, responding to the dynamic landscape of 6G technology and the pressing need for a fully connected distributed in... ver más
Revista: Information

 
Zeyu Xu, Wenbin Yu, Chengjun Zhang and Yadang Chen    
In the era of noisy intermediate-scale quantum (NISQ) computing, the synergistic collaboration between quantum and classical computing models has emerged as a promising solution for tackling complex computational challenges. Long short-term memory (LSTM)... ver más
Revista: Information