ARTÍCULO
TITULO

ChineseCTRE: A Model for Geographical Named Entity Recognition and Correction Based on Deep Neural Networks and the BERT Model

Wei Zhang    
Jingtao Meng    
Jianhua Wan    
Chengkun Zhang    
Jiajun Zhang    
Yuanyuan Wang    
Liuchang Xu and Fei Li    

Resumen

Social media is widely used to share real-time information and report accidents during natural disasters. Named entity recognition (NER) is a fundamental task of geospatial information applications that aims to extract location names from natural language text. As a result, the identification of location names from social media information has gradually become a demand. Named entity correction (NEC), as a complementary task of NER, plays a crucial role in ensuring the accuracy of location names and further improving the accuracy of NER. Despite numerous methods having been adopted for NER, including text statistics-based and deep learning-based methods, there has been limited research on NEC. To address this gap, we propose the CTRE model, which is a geospatial named entity recognition and correction model based on the BERT model framework. Our approach enhances the BERT model by introducing incremental pre-training in the pre-training phase, significantly improving the model?s recognition accuracy. Subsequently, we adopt the pre-training fine-tuning mode of the BERT base model and extend the fine-tuning process, incorporating a neural network framework to construct the geospatial named entity recognition model and geospatial named entity correction model, respectively. The BERT model utilizes data augmentation of VGI (volunteered geographic information) data and social media data for incremental pre-training, leading to an enhancement in the model accuracy from 85% to 87%. The F1 score of the geospatial named entity recognition model reaches an impressive 0.9045, while the precision of the geospatial named entity correction model achieves 0.9765. The experimental results robustly demonstrate the effectiveness of our proposed CTRE model, providing a reference for subsequent research on location names.

 Artículos similares

       
 
Futo Ueda, Hiroto Tanouchi, Nobuyuki Egusa and Takuya Yoshihiro    
River water-level prediction is crucial for mitigating flood damage caused by torrential rainfall. In this paper, we attempt to predict river water levels using a deep learning model based on radar rainfall data instead of data from upstream hydrological... ver más
Revista: Water

 
Chenhao Wu, Longgang Xiang, Libiao Chen, Qingcen Zhong and Xiongwei Wu    
With the development of location-based services and data collection equipment, the volume of trajectory data has been growing at a phenomenal rate. Raw trajectory data come in the form of sequences of ?coordinate-time-attribute? triplets, which require c... ver más

 
Xiaojin Huang, Renzhong Guo, Xiaoming Li, Minmin Li, Yong Fan and Yaxing Li    
Understanding the economic impact of COVID-19 is the foundation for formulating targeted policies promoting economic recovery. This study uses panel data of the county economy in the Guangdong?Hong Kong?Macao Greater Bay Area (GBA) from 2017 to 2022. Fir... ver más

 
Ce Liang, Jun Zhu, Jinbin Zhang, Qing Zhu, Jingyi Lu, Jianbo Lai and Jianlin Wu    
It is essential to establish a digital twin scene, which helps to depict the dynamically changing geographical environment accurately. Digital twins could improve the refined management level of intelligent tunnel construction; however, research on geogr... ver más

 
Baohua Wei and Lei Zhu    
Bike sharing offers a usable form of feeder transportation for connecting to public transportation and effectively meets unmet travel demands, alleviating the pressure on public transportation systems by diverting urban commuters. To advance the comprehe... ver más