Inicio  /  Applied Sciences  /  Vol: 10 Par: 23 (2020)  /  Artículo
ARTÍCULO
TITULO

Decomposition Methods for Machine Learning with Small, Incomplete or Noisy Datasets

Cesar Federico Caiafa    
Jordi Solé-Casals    
Pere Marti-Puig    
Sun Zhe and Toshihisa Tanaka    

Resumen

In many machine learning applications, measurements are sometimes incomplete or noisy resulting in missing features. In other cases, and for different reasons, the datasets are originally small, and therefore, more data samples are required to derive useful supervised or unsupervised classification methods. Correct handling of incomplete, noisy or small datasets in machine learning is a fundamental and classic challenge. In this article, we provide a unified review of recently proposed methods based on signal decomposition for missing features imputation (data completion), classification of noisy samples and artificial generation of new data samples (data augmentation). We illustrate the application of these signal decomposition methods in diverse selected practical machine learning examples including: brain computer interface, epileptic intracranial electroencephalogram signals classification, face recognition/verification and water networks data analysis. We show that a signal decomposition approach can provide valuable tools to improve machine learning performance with low quality datasets.

 Artículos similares

       
 
Haoyu Lin, Pengkun Quan, Zhuo Liang, Dongbo Wei and Shichun Di    
With the rise of electric vehicles, autonomous driving, and valet parking technologies, considerable research has been dedicated to automatic charging solutions. While the current focus lies on charging robot design and the visual positioning of charging... ver más
Revista: Applied Sciences

 
Zitong Wang, Enrang Zheng, Jianguo Liu and Tuo Guo    
Traditional methods of orthogonal basis function decomposition have been extensively used to detect magnetic anomaly signals. However, the determination of the relative velocity between the detection platform and the magnetic target remains elusive in pr... ver más
Revista: Applied Sciences

 
Mahbuba Begum, Sumaita Binte Shorif, Mohammad Shorif Uddin, Jannatul Ferdush, Tony Jan, Alistair Barros and Md Whaiduzzaman    
Digital multimedia elements such as text, image, audio, and video can be easily manipulated because of the rapid rise of multimedia technology, making data protection a prime concern. Hence, copyright protection, content authentication, and integrity ver... ver más
Revista: Algorithms

 
Fiza Zafar, Alicia Cordero, Husna Maryam and Juan R. Torregrosa    
Power flow problems can be solved in a variety of ways by using the Newton?Raphson approach. The nonlinear power flow equations depend upon voltages Vi" role="presentation">|????|Vi V i and phase angle δ" role="presentation">??d d . An electri... ver más
Revista: Algorithms

 
Jose Rodrigo, Luis Sanchez de Leon, Jose L. Montañes and Jose M. Vega    
A very fast reduced order model is developed to monitor aeroengines condition (defining their degradation from a baseline state) in real time, by using synthetic data collected in specific sensors. This reduced model is constructed by applying higher-ord... ver más
Revista: Aerospace