Inicio  /  Applied Sciences  /  Vol: 12 Par: 6 (2022)  /  Artículo
ARTÍCULO
TITULO

Using Feature Selection with Machine Learning for Generation of Insurance Insights

Ayman Taha    
Bernard Cosgrave and Susan Mckeever    

Resumen

Insurance is a data-rich sector, hosting large volumes of customer data that is analysed to evaluate risk. Machine learning techniques are increasingly used in the effective management of insurance risk. Insurance datasets by their nature, however, are often of poor quality with noisy subsets of data (or features). Choosing the right features of data is a significant pre-processing step in the creation of machine learning models. The inclusion of irrelevant and redundant features has been demonstrated to affect the performance of learning models. In this article, we propose a framework for improving predictive machine learning techniques in the insurance sector via the selection of relevant features. The experimental results, based on five publicly available real insurance datasets, show the importance of applying feature selection for the removal of noisy features before performing machine learning techniques, to allow the algorithm to focus on influential features. An additional business benefit is the revelation of the most and least important features in the datasets. These insights can prove useful for decision making and strategy development in areas/business problems that are not limited to the direct target of the downstream algorithms. In our experiments, machine learning techniques based on a set of selected features suggested by feature selection algorithms outperformed the full feature set for a set of real insurance datasets. Specifically, 20% and 50% of features in our five datasets had improved downstream clustering and classification performance when compared to whole datasets. This indicates the potential for feature selection in the insurance sector to both improve model performance and to highlight influential features for business insights.

 Artículos similares

       
 
Zijia Zheng, Yizhu Jiang, Qiutong Zhang, Yanling Zhong and Lizheng Wang    
The timely monitoring of urban water bodies using unmanned aerial vehicle (UAV)-mounted remote sensing technology is crucial for urban water resource protection and management. Addressing the limitations of the use of satellite data in inferring the wate... ver más
Revista: Water

 
Zhongda Ren, Chuanjie Liu, Yafei Ou, Peng Zhang, Heshan Fan, Xiaolong Zhao, Heqin Cheng, Lizhi Teng, Ming Tang and Fengnian Zhou    
Effectively simulating the variation in suspended sediment concentration (SSC) in estuaries during typhoons is significant for the water quality and ecological conditions of estuarine shoal wetlands and their adjacent coastal waters. During typhoons, SSC... ver más
Revista: Water

 
Iqbal Muhammad Zubair, Yung-Seop Lee and Byunghoon Kim    
The selection of group features is a critical aspect in reducing model complexity by choosing the most essential group features, while eliminating the less significant ones. The existing group feature selection methods select a set of important group fea... ver más
Revista: Applied Sciences

 
Guanwen Zhang and Dongnian Jiang    
Rolling bearings are one of the most important and indispensable components of a mechanical system, and an accurate prediction of their remaining life is essential to ensuring the reliable operation of a mechanical system. In order to effectively utilize... ver más
Revista: Applied Sciences

 
Fan Lin, Dengjie Chen, Cheng Liu and Jincheng He    
This study pioneered a non-destructive testing approach to evaluating the physicochemical properties of golden passion fruit by developing a platform to analyze the fruit?s electrical characteristics. By using dielectric properties, the method accurately... ver más
Revista: Applied Sciences