REVISTA
Algorithms

TODAS

Inicio / Algorithms / Vol: 17 Par: 1 (2024) / Artículo

ARTÍCULO

TITULO

Machine Learning Model for Multiomics Biomarkers Identification for Menopause Status in Breast Cancer

Firas Alghanim

Ibrahim Al-Hurani

Hazem Qattous

Abdullah Al-Refai

Osamah Batiha

Abedalrhman Alkhateeb and Salama Ikki

Resumen

Identifying menopause-related breast cancer biomarkers is crucial for enhancing diagnosis, prognosis, and personalized treatment at that stage of the patient?s life. In this paper, we present a comprehensive framework for extracting multiomics biomarkers specifically related to breast cancer incidence before and after menopause. Our approach integrates DNA methylation, gene expression, and copy number alteration data using a systematic pipeline encompassing data preprocessing and handling class imbalance, dimensionality reduction, and classification. The framework starts with MutSigCV for data preprocessing and ensuring data quality. The Synthetic Minority Over-sampling Technique (SMOTE) up-sampling technique is applied to address the class imbalance representation. Then, Principal Component Analysis (PCA) transforms the DNA methylation, gene expression, and copy number alteration data into a latent space. The purpose is to discard irrelevant variations and extract relevant information. Finally, a classification model is built based on the transformed multiomics data into a unified representation. The framework contributes to understanding the complex interplay between menopause and breast cancer, thereby revealing more precise diagnostic and therapeutic strategies in the future. The explainable artificial intelligence model Shapley based on the XGBoost regressor showed the power of the selected gene expressions for predicting the menopause status, and the potential biomarkers included RUNX1, PTEN, MAP3K1, and CDH1. The literature confirmed the findings.

Palabras claves

multiomics data integration - machine learning - breast cancer - menopause - classification - explainable AI

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 17 Parte: 1 (2024)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Water
Facta Universitatis. Series: Economics and Organization
Estudios de Economía

DOI

https://doi.org/10.3390/a17010013

Artículos similares

Using Real-Time Data and Unsupervised Machine Learning Techniques to Study Large-Scale Spatio?Temporal Characteristics of Wastewater Discharges and their Influence on Surface Water Quality in the Yangtze River Basin

Acceso

Zhenzhen Di, Miao Chang, Peikun Guo, Yang Li and Yin Chang

Most worldwide industrial wastewater, including in China, is still directly discharged to aquatic environments without adequate treatment. Because of a lack of data and few methods, the relationships between pollutants discharged in wastewater and those ... ver más

Revista: Water

CREDIT SCORING WITH AN ENSEMBLE DEEP LEARNING CLASSIFICATION METHODS ? COMPARISON WITH TRADITIONAL METHODS

Acceso

Ognjen Radovic,Srdan Marinkovic,Jelena Radojicic

Credit scoring attracts special attention of financial institutions. In recent years, deep learning methods have been particularly interesting. In this paper, we compare the performance of ensemble deep learning methods based on decision trees with the b... ver más

Revista: Facta Universitatis. Series: Economics and Organization

Business failure prediction. A contribution to the synthesis of a theory, through comparative analysis of different prediction techniques

Acceso

Pablo de Llano, Carlos Piñeiro, Manuel Rodríguez Pág. pp. 163 - 198

This paper offers a comparative analysis of the effectiveness of eight popular forecasting methods: univariate, linear, discriminate and logit regression; recursive partitioning, rough sets, artificial neural networks, and DEA. Our goals are: clarify the... ver más

Revista: Estudios de Economía

Application of data mining and artificial intelligence techniques to mass spectrometry data for knowledge discovery

Acceso

Hugo López-Fernández Pág. 22 - 25

Mass spectrometry using matrix assisted laser desorption ionization coupled to time of flight analyzers (MALDI-TOF MS) has become popular during the last decade due to its high speed, sensitivity and robustness for detecting proteins and peptides. This a... ver más

Revista: Inteligencia Artificial

Cardiovascular Health Management in Diabetic Patients with Machine-Learning-Driven Predictions and Interventions

Acceso

Rejath Jose, Faiz Syed, Anvin Thomas and Milan Toma

The advancement of machine learning in healthcare offers significant potential for enhancing disease prediction and management. This study harnesses the PyCaret library?a Python-based machine learning toolkit?to construct and refine predictive models for... ver más

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas