Redirigiendo al acceso original de articulo en 19 segundos...
ARTÍCULO
TITULO

Efficiency of Extreme Gradient Boosting for Imbalanced Land Cover Classification Using an Extended Margin and Disagreement Performance

Fei Sun    
Run Wang    
Bo Wan    
Yanjun Su    
Qinghua Guo    
Youxin Huang and Xincai Wu    

Resumen

Imbalanced learning is a methodological challenge in remote sensing communities, especially in complex areas where the spectral similarity exists between land covers. Obtaining high-confidence classification results for imbalanced class issues is highly important in practice. In this paper, extreme gradient boosting (XGB), a novel tree-based ensemble system, is employed to classify the land cover types in Very-high resolution (VHR) images with imbalanced training data. We introduce an extended margin criterion and disagreement performance to evaluate the efficiency of XGB in imbalanced learning situations and examine the effect of minority class spectral separability on model performance. The results suggest that the uncertainty of XGB associated with correct classification is stable. The average probability-based margin of correct classification provided by XGB is 0.82, which is about 46.30% higher than that by random forest (RF) method (0.56). Moreover, the performance uncertainty of XGB is insensitive to spectral separability after the sample imbalance reached a certain level (minority:majority > 10:100). The impact of sample imbalance on the minority class is also related to its spectral separability, and XGB performs better than RF in terms of user accuracy for the minority class with imperfect separability. The disagreement components of XGB are better and more stable than RF with imbalanced samples, especially for complex areas with more types. In addition, appropriate sample imbalance helps to improve the trade-off between the recognition accuracy of XGB and the sample cost. According to our analysis, this margin-based uncertainty assessment and disagreement performance can help users identify the confidence level and error component in similar classification performance (overall, producer, and user accuracies).

 Artículos similares

       
 
Karolina Migdal, Krzysztof Józwiakowski, Wojciech Czekala, Paulina Sliz, Jorge Manuel Rodrigues Tavares and Adelaide Almeida    
The objective of this study was to model the operation of a vertical-flow constructed wetland (VF-CW) for domestic wastewater, using Monte-Carlo simulations and selected probability distributions of various random variables. The analysis was based on col... ver más
Revista: Water

 
Zhenwei Yang, Hang Lv, Xinyi Wang, Hengrui Yan and Zhaofeng Xu    
In recent years, inrush water has hampered the regular mining of coal mines, and the proper identification of the source of inrush water is critical to the prevention and management of water hazards in mines. This paper extracts the standard water chemis... ver más
Revista: Water

 
Saad Sh. Sammen, Mohammad Ehteram, Zohreh Sheikh Khozani and Lariyah Mohd Sidek    
Predicting reservoir water levels helps manage droughts and floods. Predicting reservoir water level is complex because it depends on factors such as climate parameters and human intervention. Therefore, predicting water level needs robust models. Our st... ver más
Revista: Water

 
Hengzhi Hu, Hanwei Yang, Jiahong Wen, Min Zhang and Yanjuan Wu    
Under climate warming, the frequency and intensity of extreme rainstorms-induced urban pluvial floods are significantly increasing, leading to severe flooding risks in megacities. An integrated model that incorporates rainfall processing, waterlogging si... ver más
Revista: Water

 
Albert Larson, Abdeltawab Hendawi, Thomas Boving, Soni M. Pradhanang and Ali S. Akanda    
The impact of climate change continues to manifest itself daily in the form of extreme events and conditions such as droughts, floods, heatwaves, and storms. Better forecasting tools are mandatory to calibrate our response to these hazards and help adapt... ver más
Revista: Hydrology