Inicio  /  Applied Sciences  /  Vol: 9 Par: 20 (2019)  /  Artículo
ARTÍCULO
TITULO

Mutual Information Input Selector and Probabilistic Machine Learning Utilisation for Air Pollution Proxies

Martha A. Zaidan    
Lubna Dada    
Mansour A. Alghamdi    
Hisham Al-Jeelani    
Heikki Lihavainen    
Antti Hyvärinen and Tareq Hussein    

Resumen

An air pollutant proxy is a mathematical model that estimates an unobserved air pollutant using other measured variables. The proxy is advantageous to fill missing data in a research campaign or to substitute a real measurement for minimising the cost as well as the operators involved (i.e., virtual sensor). In this paper, we present a generic concept of pollutant proxy development based on an optimised data-driven approach. We propose a mutual information concept to determine the interdependence of different variables and thus select the most correlated inputs. The most relevant variables are selected to be the best proxy inputs, where several metrics and data loss are also involved for guidance. The input selection method determines the used data for training pollutant proxies based on a probabilistic machine learning method. In particular, we use a Bayesian neural network that naturally prevents overfitting and provides confidence intervals around its output prediction. In this way, the prediction uncertainty could be assessed and evaluated. In order to demonstrate the effectiveness of our approach, we test it on an extensive air pollution database to estimate ozone concentration.

 Artículos similares

       
 
Junartho Halomoan, Kalamullah Ramli, Dodi Sudiana, Teddy Surya Gunawan and Muhammad Salman    
One of the WHO?s strategies to reduce road traffic injuries and fatalities is to enhance vehicle safety. Driving fatigue detection can be used to increase vehicle safety. Our previous study developed an ECG-based driving fatigue detection framework with ... ver más
Revista: Information

 
Anika Strittmatter, Anna Caroli and Frank G. Zöllner    
Multimodal image registration is an important component of medical image processing, allowing the integration of complementary information from various imaging modalities to improve clinical applications like diagnosis and treatment planning. We proposed... ver más
Revista: Applied Sciences

 
Siyi Zhou, Kewei Cai, Yanhong Feng, Xiaomeng Tang, Hongshuai Pang, Jiaqi He and Xiang Shi    
In aquaculture, the accurate recognition of fish underwater has outstanding academic value and economic benefits for scientifically guiding aquaculture production, which assists in the analysis of aquaculture programs and studies of fish behavior. Howeve... ver más

 
Shuang Wang, Amin Beheshti, Yufei Wang, Jianchao Lu, Quan Z. Sheng, Stephen Elbourn and Hamid Alinejad-Rokny    
Instructors face significant time and effort constraints when grading students? assessments on a large scale. Clustering similar assessments is a unique and effective technique that has the potential to significantly reduce the workload of instructors in... ver más
Revista: Algorithms

 
Deepesh Chugh, Himanshu Mittal, Amit Saxena, Ritu Chauhan, Eiad Yafi and Mukesh Prasad    
Determining the optimal feature set is a challenging problem, especially in an unsupervised domain. To mitigate the same, this paper presents a new unsupervised feature selection method, termed as densest feature graph augmentation with disjoint feature ... ver más
Revista: Algorithms