Redirigiendo al acceso original de articulo en 24 segundos...
Inicio  /  Applied Sciences  /  Vol: 12 Par: 10 (2022)  /  Artículo
ARTÍCULO
TITULO

The Impact of Ensemble Techniques on Software Maintenance Change Prediction: An Empirical Study

Hadeel Alsolai and Marc Roper    

Resumen

Various prediction models have been proposed by researchers to predict the change-proneness of classes based on source code metrics. However, some of these models suffer from low prediction accuracy because datasets exhibit high dimensionality or imbalanced classes. Recent studies suggest that using ensembles to integrate several models, select features, or perform sampling has the potential to resolve issues in the datasets and improve the prediction accuracy. This study aims to empirically evaluate the effectiveness of the ensemble models, feature selection, and sampling techniques on predicting change-proneness using different metrics. We conduct an empirical study to compare the performance of four machine learning models (naive Bayes, support vector machines, k-nearest neighbors, and random forests) on seven datasets for predicting change-proneness. We use two types of feature selection (relief and Pearson?s correlation coefficient) and three types of ensemble sampling techniques, which integrate different types of sampling techniques (SMOTE, spread sub-sample, and randomize). The results of this study reveal that the ensemble feature selection and sampling techniques yield improved prediction accuracy over most of the investigated models, and using sampling techniques increased the prediction accuracy of all models. Random forests provide a significant improvement over other prediction models and obtained the highest value of the average of the area under curve in all scenarios. The proposed ensemble feature selection and sampling techniques, along with the ensemble model (random forests), were found beneficial in improving the prediction accuracy of change-proneness.

 Artículos similares

       
 
Mashael Aldayel, Amira Kharrat and Abeer Al-Nafjan    
Individual choices and preferences are important factors that impact decision making. Artificial intelligence can predict decisions by objectively detecting individual choices and preferences using natural language processing, computer vision, and machin... ver más
Revista: Applied Sciences

 
Temidayo Oluwatosin Omotehinwa and David Opeoluwa Oyewola    
Unsolicited emails, popularly referred to as spam, have remained one of the biggest threats to cybersecurity globally. More than half of the emails sent in 2021 were spam, resulting in huge financial losses. The tenacity and perpetual presence of the adv... ver más
Revista: Applied Sciences

 
Amna Al-Sayed, Mashael M. Khayyat and Nuha Zamzami    
Different data types are frequently included in clinical data. Applying machine learning algorithms to mixed data can be difficult and impact the output accuracy and quality. This paper proposes a hybrid model of unsupervised and supervised learning tech... ver más
Revista: Applied Sciences

 
Mohamad Abou Ali, Fadi Dornaika and Ignacio Arganda-Carreras    
Artificial intelligence (AI) has emerged as a cutting-edge tool, simultaneously accelerating, securing, and enhancing the diagnosis and treatment of patients. An exemplification of this capability is evident in the analysis of peripheral blood smears (PB... ver más
Revista: Algorithms

 
Yilin Yang, Qiuming Cheng, Jin-Yeu Tsou, Ka-Po Wong, Yanzhuo Men and Yuanzhi Zhang    
Under the influence of global warming, the problem of sea-level rise is becoming increasingly prominent. The northern part of the South China Sea (SCS) is low lying, with intense economic development, and densely populated. These characteristics make the... ver más