Redirigiendo al acceso original de articulo en 23 segundos...
ARTÍCULO
TITULO

Analyzing Geographic Questions Using Embedding-based Topic Modeling

Jonghyeon Yang    
Hanme Jang and Kiyun Yu    

Resumen

Recently, open-domain question-answering systems have achieved tremendous progress because of developments in large language models (LLMs), and have successfully been applied to question-answering (QA) systems, or Chatbots. However, there has been little progress in open-domain question answering in the geographic domain. Existing open-domain question-answering research in the geographic domain relies heavily on rule-based semantic parsing approaches using few data. To develop intelligent GeoQA agents, it is crucial to build QA systems upon datasets that reflect the real users? needs regarding the geographic domain. Existing studies have analyzed geographic questions using the geographic question corpora Microsoft MAchine Reading Comprehension (MS MARCO), comprising real-world user queries from Bing in terms of structural similarity, which does not discover the users? interests. Therefore, we aimed to analyze location-related questions in MS MARCO based on semantic similarity, group similar questions into a cluster, and utilize the results to discover the users? interests in the geographic domain. Using a sentence-embedding-based topic modeling approach to cluster semantically similar questions, we successfully obtained topic models that could gather semantically similar documents into a single cluster. Furthermore, we successfully discovered latent topics within a large collection of questions to guide practical GeoQA systems on relevant questions.

Palabras claves

 Artículos similares

       
 
Aldo Fiori, Irene Pomarico, Antonio Zarlenga, Vittorio Catani and Guido Leone    
This work extends the overlay and index methods for intrinsic groundwater vulnerability, that typically involve the soil surface and the vadose zone, to groundwater (saturated) transport. The method is ?hybrid? as it combines the standard overlay and ind... ver más
Revista: Water

 
Khaled Abuhasel    
This study compares the environmental sustainability of two cities in Saudi Arabia, Abha, and Bisha, through their green spaces, by analyzing green spaces in both cities. And the application of spatial statistics tools in the Arc Map program, to measure ... ver más

 
Yanjie Sun, Mingguang Wu, Xiaoyan Liu and Liangchen Zhou    
High-precision dynamic traffic noise maps can describe the spatial and temporal distributions of noise and are necessary for actual noise prevention. Existing monitoring point-based methods suffer from limited spatial adaptability, and prediction model-b... ver más

 
Montserrat Delpino-Chamy and Yolanda Pérez Albert    
(1) Background: To assess the quality of the built environment, it is necessary to study both the physical components and the inhabitants? perceptions. However, since objective indicators are easily measurable, most studies have centered only on analyzin... ver más
Revista: Urban Science

 
Xuehua Han and Juanle Wang    
Public behavior in cyberspace is extremely sensitive to emergency disaster events. Using appropriate methodologies to capture the semantic evolution of social media users? behaviors and discover how it varies across geographic space and time still presen... ver más