Inicio  /  Applied Sciences  /  Vol: 13 Par: 4 (2023)  /  Artículo
ARTÍCULO
TITULO

Tabular Data Generation to Improve Classification of Liver Disease Diagnosis

Mohammad Alauthman    
Amjad Aldweesh    
Ahmad Al-qerem    
Faisal Aburub    
Yazan Al-Smadi    
Awad M. Abaker    
Omar Radhi Alzubi and Bilal Alzubi    

Resumen

Liver diseases are among the most common diseases worldwide. Because of the high incidence and high mortality rate, these diseases diagnoses are vital. Several elements harm the liver. For instance, obesity, undiagnosed hepatitis infection, and alcohol abuse. This causes abnormal nerve function, bloody coughing or vomiting, insufficient kidney function, hepatic failure, jaundice, and liver encephalopathy.. The diagnosis of this disease is very expensive and complex. Therefore, this work aims to assess the performance of various machine learning algorithms at decreasing the cost of predictive diagnoses of chronic liver disease. In this study, five machine learning algorithms were employed: Logistic Regression, K-Nearest Neighbor, Decision Tree, Support Vector Machine, and Artificial Neural Network (ANN) algorithm. In this work, we examined the effects of the increased prediction accuracy of Generative Adversarial Networks (GANs) and the synthetic minority oversampling technique (SMOTE). Generative opponents? networks (GANs) are a mechanism to produce artificial data with a distribution close to real data distribution. This is achieved by training two different networks: the generator, which seeks to produce new and real samples, and the discriminator, which classifies the augmented samples using supervised classifications. Statistics show that the use of increased data slightly improves the performance of the classifier.

 Artículos similares

       
 
Leon Kopitar, Iztok Fister, Jr. and Gregor Stiglic    
Introduction: Type 2 diabetes mellitus is a major global health concern, but interpreting machine learning models for diagnosis remains challenging. This study investigates combining association rule mining with advanced natural language processing to im... ver más
Revista: Information

 
Samuel de Oliveira, Oguzhan Topsakal and Onur Toker    
Automated Machine Learning (AutoML) is a subdomain of machine learning that seeks to expand the usability of traditional machine learning methods to non-expert users by automating various tasks which normally require manual configuration. Prior benchmark... ver más
Revista: Information

 
Jinhong Wu, Konstantinos Plataniotis, Lucy Liu, Ehsan Amjadian and Yuri Lawryshyn    
Synthetic data, artificially generated by computer programs, has become more widely used in the financial domain to mitigate privacy concerns. Variational Autoencoder (VAE) is one of the most popular deep-learning models for generating synthetic data. Ho... ver más
Revista: Algorithms

 
Bradley Walters, Sandra Ortega-Martorell, Ivan Olier and Paulo J. G. Lisboa    
A lack of transparency in machine learning models can limit their application. We show that analysis of variance (ANOVA) methods extract interpretable predictive models from them. This is possible because ANOVA decompositions represent multivariate funct... ver más
Revista: Algorithms

 
Robert Aufschläger, Jakob Folz, Elena März, Johann Guggumos, Michael Heigl, Benedikt Buchner and Martin Schramm    
Revista: Information