We Know You Are Living in Bali: Location Prediction of Twitter Users Using BERT Language Model

Lihardo Faisal Simanjuntak

Rahmad Mahendra and Evi Yulianti

Resumen

Twitter user location data provide essential information that can be used for various purposes. However, user location is not easy to identify because many profiles omit this information, or users enter data that do not correspond to their actual locations. Several related works attempted to predict location on English-language tweets. In this study, we attempted to predict the location of Indonesian tweets. We utilized machine learning approaches, i.e., long-short term memory (LSTM) and bidirectional encoder representations from transformers (BERT) to infer Twitter users? home locations using display name in profile, user description, and user tweets. By concatenating display name, description, and aggregated tweet, the model achieved the best accuracy of 0.77. The performance of the IndoBERT model outperformed several baseline models.

Palabras claves

Twitter - location - prediction - BERT - Indonesian

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 6 Parte: 3 (2022)

MATERIAS

INFRAESTRUCTURA

REVISTAS SIMILARES

ISPRS International Journal of Geo-Information
Informed Infrastructure
Transportation Research Procedia

DOI