Inicio  /  Informatics  /  Vol: 9 Par: 1 (2022)  /  Artículo
ARTÍCULO
TITULO

Predictive Model for ICU Readmission Based on Discharge Summaries Using Machine Learning and Natural Language Processing

Negar Orangi-Fard    
Alireza Akhbardeh and Hersh Sagreiya    

Resumen

Predicting ICU readmission risk will help physicians make decisions regarding discharge. We used discharge summaries to predict ICU 30-day readmission risk using text mining and machine learning (ML) with data from the Medical Information Mart for Intensive Care III (MIMIC-III). We used Natural Language Processing (NLP) and the Bag-of-Words approach on discharge summaries to build a Document-Term-Matrix with 3000 features. We compared the performance of support vector machines with the radial basis function kernel (SVM-RBF), adaptive boosting (AdaBoost), quadratic discriminant analysis (QDA), least absolute shrinkage and selection operator (LASSO), and Ridge Regression. A total of 4000 patients were used for model training and 6000 were used for validation. Using the bag-of-words determined by NLP, the area under the receiver operating characteristic (AUROC) curve was 0.71, 0.68, 0.65, 0.69, and 0.65 correspondingly for SVM-RBF, AdaBoost, QDA, LASSO, and Ridge Regression. We then used the SVM-RBF model for feature selection by incrementally adding features to the model from 1 to 3000 bag-of-words. Through this exhaustive search approach, only 825 features (words) were dominant. Using those selected features, we trained and validated all ML models. The AUROC curve was 0.74, 0.69, 0.67, 0.70, and 0.71 respectively for SVM-RBF, AdaBoost, QDA, LASSO, and Ridge Regression. Overall, this technique could predict ICU readmission relatively well.

 Artículos similares

       
 
Junling Zhang, Min Mei, Jun Wang, Guangpeng Shang, Xuefeng Hu, Jing Yan and Qian Fang    
The deformation of tunnel support structures during tunnel construction is influenced by geological factors, geometrical factors, support factors, and construction factors. Accurate prediction of tunnel support structure deformation is crucial for engine... ver más
Revista: Applied Sciences

 
Yongyong Zhao, Jinghua Wang, Guohua Cao and Xu Yao    
This study introduces a reduced-order leg dynamic model to simplify the controller design and enhance robustness. The proposed multi-loop control scheme tackles tracking control issues in legged robots, including joint angle and contact-force regulation,... ver más
Revista: Applied Sciences

 
Lilai Jin, Sarah J. Higgins, James A. Thompson, Michael P. Strager, Sean E. Collins and Jason A. Hubbart    
Saturated hydraulic conductivity (Ksat) is a hydrologic flux parameter commonly used to determine water movement through the saturated soil zone. Understanding the influences of land-use-specific Ksat on the model estimation error of water balance compon... ver más
Revista: Water

 
Sofía Ramos-Pulido, Neil Hernández-Gress and Gabriela Torres-Delgado    
Current research on the career satisfaction of graduates limits educational institutions in devising methods to attain high career satisfaction. Thus, this study aims to use data science models to understand and predict career satisfaction based on infor... ver más
Revista: Informatics

 
Wenhao Li, Xianxia Zhang, Yueying Wang and Songbo Xie    
Model predictive control (MPC), an extensively developed rolling optimization control method, is widely utilized in the industrial field. While some researchers have incorporated predictive control into underactuated unmanned surface vehicles (USVs), mos... ver más