REVISTA
Aerospace

TODAS

Inicio / Aerospace / Vol: 7 Par: 10 (2020) / Artículo

ARTÍCULO

TITULO

Natural Language Processing Based Method for Clustering and Analysis of Aviation Safety Narratives

Rodrigo L. Rose

Tejas G. Puranik and Dimitri N. Mavris

Resumen

The complexity of commercial aviation operations has grown substantially in recent years, together with a diversification of techniques for collecting and analyzing flight data. As a result, data-driven frameworks for enhancing flight safety have grown in popularity. Data-driven techniques offer efficient and repeatable exploration of patterns and anomalies in large datasets. Text-based flight safety data presents a unique challenge in its subjectivity, and relies on natural language processing tools to extract underlying trends from narratives. In this paper, a methodology is presented for the analysis of aviation safety narratives based on text-based accounts of in-flight events and categorical metadata parameters which accompany them. An extensive pre-processing routine is presented, including a comparison between numeric models of textual representation for the purposes of document classification. A framework for categorizing and visualizing narratives is presented through a combination of k-means clustering and 2-D mapping with t-Distributed Stochastic Neighbor Embedding (t-SNE). A cluster post-processing routine is developed for identifying driving factors in each cluster and building a hierarchical structure of cluster and sub-cluster labels. The Aviation Safety Reporting System (ASRS), which includes over a million de-identified voluntarily submitted reports describing aviation safety incidents for commercial flights, is analyzed as a case study for the methodology. The method results in the identification of 10 major clusters and a total of 31 sub-clusters. The identified groupings are post-processed through metadata-based statistical analysis of the learned clusters. The developed method shows promise in uncovering trends from clusters that are not evident in existing anomaly labels in the data and offers a new tool for obtaining insights from text-based safety data that complement existing approaches.

Palabras claves

aviation - risk - clustering - text mining - aviation safety reporting system

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 7 Parte: 10 (2020)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Applied Sciences
Algorithms
Informatics

DOI

https://doi.org/10.3390/aerospace7100143

Artículos similares

Enhancing Product Design through AI-Driven Sentiment Analysis of Amazon Reviews Using BERT

Acceso

Mahammad Khalid Shaik Vadla, Mahima Agumbe Suresh and Vimal K. Viswanathan

Understanding customer emotions and preferences is paramount for success in the dynamic product design landscape. This paper presents a study to develop a prediction pipeline to detect the aspect and perform sentiment analysis on review data. The pre-tra... ver más

Revista: Algorithms

Aiding ICD-10 Encoding of Clinical Health Records Using Improved Text Cosine Similarity and PLM-ICD

Acceso

Hugo Silva, Vítor Duque, Mário Macedo and Mateus Mendes

The International Classification of Diseases, 10th edition (ICD-10), has been widely used for the classification of patient diagnostic information. This classification is usually performed by dedicated physicians with specific coding training, and it is ... ver más

Revista: Algorithms

The Research Interest in ChatGPT and Other Natural Language Processing Tools from a Public Health Perspective: A Bibliometric Analysis

Acceso

Giuliana Favara, Martina Barchitta, Andrea Maugeri, Roberta Magnano San Lio and Antonella Agodi

Background: Natural language processing, such as ChatGPT, demonstrates growing potential across numerous research scenarios, also raising interest in its applications in public health and epidemiology. Here, we applied a bibliometric analysis for a syste... ver más

Revista: Informatics

A Comprehensive Review of AI Techniques for Addressing Algorithmic Bias in Job Hiring

Acceso

Elham Albaroudi, Taha Mansouri and Ali Alameer

The study comprehensively reviews artificial intelligence (AI) techniques for addressing algorithmic bias in job hiring. More businesses are using AI in curriculum vitae (CV) screening. While the move improves efficiency in the recruitment process, it is... ver más

Revista: AI

Clustering-Based Joint Topic-Sentiment Modeling of Social Media Data: A Neural Networks Approach

Acceso

David Hanny and Bernd Resch

With the vast amount of social media posts available online, topic modeling and sentiment analysis have become central methods to better understand and analyze online behavior and opinion. However, semantic and sentiment analysis have rarely been combine... ver más

Revista: Information

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas