REVISTA
Future Internet

TODAS

Inicio / Future Internet / Vol: 13 Par: 1 (2021) / Artículo

ARTÍCULO

TITULO

A Classifier to Detect Informational vs. Non-Informational Heart Attack Tweets

Ola Karajeh

Dirar Darweesh

Omar Darwish

Noor Abu-El-Rub

Belal Alsinglawi and Nasser Alsaedi

Resumen

Social media sites are considered one of the most important sources of data in many fields, such as health, education, and politics. While surveys provide explicit answers to specific questions, posts in social media have the same answers implicitly occurring in the text. This research aims to develop a method for extracting implicit answers from large tweet collections, and to demonstrate this method for an important concern: the problem of heart attacks. The approach is to collect tweets containing ?heart attack? and then select from those the ones with useful information. Informational tweets are those which express real heart attack issues, e.g., ?Yesterday morning, my grandfather had a heart attack while he was walking around the garden.? On the other hand, there are non-informational tweets such as ?Dropped my iPhone for the first time and almost had a heart attack.? The starting point was to manually classify around 7000 tweets as either informational (11%) or non-informational (89%), thus yielding a labeled dataset to use in devising a machine learning classifier that can be applied to our large collection of over 20 million tweets. Tweets were cleaned and converted to a vector representation, suitable to be fed into different machine-learning algorithms: Deep neural networks, support vector machine (SVM), J48 decision tree and naïve Bayes. Our experimentation aimed to find the best algorithm to use to build a high-quality classifier. This involved splitting the labeled dataset, with 2/3 used to train the classifier and 1/3 used for evaluation besides cross-validation methods. The deep neural network (DNN) classifier obtained the highest accuracy (95.2%). In addition, it obtained the highest F1-scores with (73.6%) and (97.4%) for informational and non-informational classes, respectively.

Palabras claves

machine learning - classification - support vector machine - deep neural networks - tweets - heart attack - health

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 13 Parte: 1 (2021)

MATERIAS

INFRAESTRUCTURA

REVISTAS SIMILARES

Future Internet
IoT
Big Data and Cognitive Computing

DOI

https://doi.org/10.3390/fi13010019

Artículos similares

Panic Detection Using Machine Learning and Real-Time Biometric and Spatiotemporal Data

Acceso

Ilias Lazarou, Anastasios L. Kesidis, George Hloupis and Andreas Tsatsaris

It is common sense that immediate response and action are among the most important terms when it comes to public safety, and emergency response systems (ERS) are technology components strictly tied to this purpose. While the use of ERSs is increasingly a... ver más

Revista: ISPRS International Journal of Geo-Information

Machine Learning-Based Lie Detector Applied to a Novel Annotated Game Dataset

Acceso

Nuria Rodriguez-Diaz, Decky Aspandi, Federico M. Sukno and Xavier Binefa

Lie detection is considered a concern for everyone in their day-to-day life, given its impact on human interactions. Thus, people normally pay attention to both what their interlocutors are saying and to their visual appearance, including the face, to fi... ver más

Revista: Future Internet

Detection of Malicious Websites Using Symbolic Classifier

Acceso

Nikola Andelic, Sandi Baressi ?egota, Ivan Lorencin and Matko Glucina

Malicious websites are web locations that attempt to install malware, which is the general term for anything that will cause problems in computer operation, gather confidential information, or gain total control over the computer. In this paper, a novel ... ver más

Revista: Future Internet

Using Machine Learning for Web Page Classification in Search Engine Optimization

Acceso

Goran Mato?evic, Jasminka Dob?a and Dunja Mladenic

This paper presents a novel approach of using machine learning algorithms based on experts? knowledge to classify web pages into three predefined classes according to the degree of content adjustment to the search engine optimization (SEO) recommendation... ver más

Revista: Future Internet

Machine Learning Reveals a Significant Shift in Water Regime Types Due to Projected Climate Change

Acceso

Georgy Ayzel

A water regime type is a cumulative representation of seasonal runoff variability in a textual, qualitative, or quantitative form developed for a particular period. The assessment of the respective water regime type changes is of high importance for loca... ver más

Revista: ISPRS International Journal of Geo-Information

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas