ARTÍCULO
TITULO

A Novel Deep Learning Approach Using Contextual Embeddings for Toponym Resolution

Ana Bárbara Cardoso    
Bruno Martins and Jacinto Estima    

Resumen

This article describes a novel approach for toponym resolution with deep neural networks. The proposed approach does not involve matching references in the text against entries in a gazetteer, instead directly predicting geo-spatial coordinates. Multiple inputs are considered in the neural network architecture (e.g., the surrounding words are considered in combination with the toponym to disambiguate), using pre-trained contextual word embeddings (i.e., ELMo or BERT) as well as bi-directional Long Short-Term Memory units, which are both regularly used for modeling textual data. The intermediate representations are then used to predict a probability distribution over possible geo-spatial regions, and finally to predict the coordinates for the input toponym. The proposed model was tested on three datasets used on previous toponym resolution studies, specifically the (i) War of the Rebellion, (ii) Local?Global Lexicon, and (iii) SpatialML corpora. Moreover, we evaluated the effect of using (i) geophysical terrain properties as external information, including information on elevation or terrain development, among others, and (ii) additional data collected from Wikipedia articles, to further help with the training of the model. The obtained results show improvements using the proposed method, when compared to previous approaches, and specifically when BERT embeddings and additional data are involved.

 Artículos similares

       
 
Eyad K. Sayhood, Nisreen S. Mohammed, Salam J. Hilo and Salih S. Salih    
This paper presents comprehensive empirical equations to predict the shear strength capacity of reinforced concrete deep beams, with a focus on improving the accuracy of existing codes. Analyzing 198 deep beams imported from 15 existing investigations, t... ver más
Revista: Infrastructures

 
Jong-Won Lee    
Enhancing the efficiency of windows is important for improving the energy efficiency of buildings. The Korean government has performed numerous building renovation projects to reduce greenhouse gas emissions and mitigate energy poverty. To reduce the cos... ver más
Revista: Buildings

 
Dejiang Wang, Quanming Jiang and Jinzheng Liu    
In the field of building information modeling (BIM), converting existing buildings into BIM by using orthophotos with digital surface models (DSMs) is a critical technical challenge. Currently, the BIM reconstruction process is hampered by the inadequate... ver más
Revista: Buildings

 
Maryam Badar and Marco Fisichella    
Fairness-aware mining of data streams is a challenging concern in the contemporary domain of machine learning. Many stream learning algorithms are used to replace humans in critical decision-making processes, e.g., hiring staff, assessing credit risk, et... ver más

 
Ge Yan, Guoan Tang, Dingyang Lu, Junfei Ma, Xin Yang and Fayuan Li    
The intervalley plain is an important type of landform for mapping, and it has good connectivity for urban construction and development on the Loess Plateau. During the global landform mapping of the Deep-time Digital Earth (DDE) Big Science Program, it ... ver más