Inicio  /  Applied Sciences  /  Vol: 13 Par: 21 (2023)  /  Artículo
ARTÍCULO
TITULO

A Study on Dropout Prediction for University Students Using Machine Learning

Choong Hee Cho    
Yang Woo Yu and Hyeon Gyu Kim    

Resumen

Student dropout is a serious issue in that it not only affects the individual students who drop out but also has negative impacts on the former university, family, and society together. To resolve this, various attempts have been made to predict student dropout using machine learning. This paper presents a model to predict student dropout at Sahmyook University using machine learning. Academic records collected from 20,050 students of the university were analyzed and used for learning. Various machine learning algorithms were used to implement the model, including Logistic Regression, Decision Tree, Random Forest, Support Vector Machine, Deep Neural Network, and LightGBM (Light Gradient Boosting Machine), and their performances were compared through experiments. We also discuss the influence of oversampling used to resolve data imbalance issues in the dropout data. For this purpose, various oversampling algorithms such as SMOTE, ADASYN, and Borderline-SMOTE were tested. Our experimental results showed that the proposed model implemented using LightGBM provided the best performance with an F1-score of 0.840, which is higher than the results of previous studies discussing the dropout prediction with the issue of class imbalance.

 Artículos similares

       
 
Roseline Oluwaseun Ogundokun, Rytis Maskeliunas, Sanjay Misra and Robertas Damasevicius    
Posture detection targets toward providing assessments for the monitoring of the health and welfare of humans have been of great interest to researchers from different disciplines. The use of computer vision systems for posture recognition might result i... ver más
Revista: Algorithms

 
Andres Gonzalez-Nucamendi, Julieta Noguez, Luis Neri, Víctor Robledo-Rella, Rosa María Guadalupe García-Castelán and David Escobar-Castillejos    
With the recent advancements of learning analytics techniques, it is possible to build predictive models of student academic performance at an early stage of a course, using student?s self-regulation learning and affective strategies (SRLAS), and their m... ver más
Revista: Applied Sciences

 
Tran Thanh Ngoc, Le Van Dai, Lam Binh Minh     Pág. 258 - 269
This study investigates data standardization methods based on the grid search (GS) algorithm for energy load forecasting, including zero-mean, min-max, max, decimal, sigmoid, softmax, median, and robust, to determine the hyperparameters of deep learning ... ver más

 
Hisashi Hayashi, Yasuyuki Okazaki, Daisuke Sakai, Shingo Morimoto and Masato Shinji    
In tunnel construction in gravel-mixed ground, it is extremely important to predict the risk of gravel dropout and the accompanying large deformation of the substrate ground. The distribution and shape of gravel vary, and it is difficult to reproduce the... ver más
Revista: Applied Sciences

 
Mengyuan Wang, Jiatao Gan, Changfeng Han, Yanbing Guo, Kaihao Chen, Ya-zhou Shi and Ben-gong Zhang    
More and more researchers use single-cell RNA sequencing (scRNA-seq) technology to characterize the transcriptional map at the single-cell level. They use it to study the heterogeneity of complex tissues, transcriptome dynamics, and the diversity of unkn... ver más
Revista: Applied Sciences