Redirigiendo al acceso original de articulo en 21 segundos...
ARTÍCULO
TITULO

Analyzing Geographic Questions Using Embedding-based Topic Modeling

Jonghyeon Yang    
Hanme Jang and Kiyun Yu    

Resumen

Recently, open-domain question-answering systems have achieved tremendous progress because of developments in large language models (LLMs), and have successfully been applied to question-answering (QA) systems, or Chatbots. However, there has been little progress in open-domain question answering in the geographic domain. Existing open-domain question-answering research in the geographic domain relies heavily on rule-based semantic parsing approaches using few data. To develop intelligent GeoQA agents, it is crucial to build QA systems upon datasets that reflect the real users? needs regarding the geographic domain. Existing studies have analyzed geographic questions using the geographic question corpora Microsoft MAchine Reading Comprehension (MS MARCO), comprising real-world user queries from Bing in terms of structural similarity, which does not discover the users? interests. Therefore, we aimed to analyze location-related questions in MS MARCO based on semantic similarity, group similar questions into a cluster, and utilize the results to discover the users? interests in the geographic domain. Using a sentence-embedding-based topic modeling approach to cluster semantically similar questions, we successfully obtained topic models that could gather semantically similar documents into a single cluster. Furthermore, we successfully discovered latent topics within a large collection of questions to guide practical GeoQA systems on relevant questions.

Palabras claves

 Artículos similares

       
 
Shuang Lu, Jianyun Huang and Jing Wu    
In the contexts of global climate change and the urbanization process, urban flooding poses significant challenges worldwide, necessitating effective rapid assessments to understand its impacts on various aspects of urban systems. This can be achieved th... ver más
Revista: Water

 
Aldo Fiori, Irene Pomarico, Antonio Zarlenga, Vittorio Catani and Guido Leone    
This work extends the overlay and index methods for intrinsic groundwater vulnerability, that typically involve the soil surface and the vadose zone, to groundwater (saturated) transport. The method is ?hybrid? as it combines the standard overlay and ind... ver más
Revista: Water

 
Yanjie Sun, Mingguang Wu, Xiaoyan Liu and Liangchen Zhou    
High-precision dynamic traffic noise maps can describe the spatial and temporal distributions of noise and are necessary for actual noise prevention. Existing monitoring point-based methods suffer from limited spatial adaptability, and prediction model-b... ver más

 
Xuehua Han and Juanle Wang    
Public behavior in cyberspace is extremely sensitive to emergency disaster events. Using appropriate methodologies to capture the semantic evolution of social media users? behaviors and discover how it varies across geographic space and time still presen... ver más

 
Chuan Yin, Binyu Zhang, Wanzeng Liu, Mingyi Du, Nana Luo, Xi Zhai and Tu Ba    
Expansion of the entity attribute information of geographic knowledge graphs is essentially the fusion of the Internet?s encyclopedic knowledge. However, it lacks structured attribute information, and synonymy and polysemy always exist. These reduce the ... ver más