ARTÍCULO
TITULO

Is ChatGPT a Good Geospatial Data Analyst? Exploring the Integration of Natural Language into Structured Query Language within a Spatial Database

Yongyao Jiang and Chaowei Yang    

Resumen

With recent advancements, large language models (LLMs) such as ChatGPT and Bard have shown the potential to disrupt many industries, from customer service to healthcare. Traditionally, humans interact with geospatial data through software (e.g., ArcGIS 10.3) and programming languages (e.g., Python). As a pioneer study, we explore the possibility of using an LLM as an interface to interact with geospatial datasets through natural language. To achieve this, we also propose a framework to (1) train an LLM to understand the datasets, (2) generate geospatial SQL queries based on a natural language question, (3) send the SQL query to the backend database, (4) parse the database response back to human language. As a proof of concept, a case study was conducted on real-world data to evaluate its performance on various queries. The results show that LLMs can be accurate in generating SQL code for most cases, including spatial joins, although there is still room for improvement. As all geospatial data can be stored in a spatial database, we hope that this framework can serve as a proxy to improve the efficiency of spatial data analyses and unlock the possibility of automated geospatial analytics.

 Artículos similares

       
 
Peng Zhang and Maged N. Kamel Boulos    
Generative AI (artificial intelligence) refers to algorithms and models, such as OpenAI?s ChatGPT, that can be prompted to generate various types of content. In this narrative review, we present a selection of representative examples of generative AI app... ver más
Revista: Future Internet

 
Konstantinos I. Roumeliotis and Nikolaos D. Tselikas    
According to numerous reports, ChatGPT represents a significant breakthrough in the field of artificial intelligence. ChatGPT is a pre-trained AI model designed to engage in natural language conversations, utilizing sophisticated techniques from Natural ... ver más
Revista: Future Internet

 
Christopher J. Lynch, Erik J. Jensen, Virginia Zamponi, Kevin O?Brien, Erika Frydenlund and Ross Gore    
Large language models (LLMs) excel in providing natural language responses that sound authoritative, reflect knowledge of the context area, and can present from a range of varied perspectives. Agent-based models and simulations consist of simulated agent... ver más
Revista: Future Internet

 
Panagiotis Skondras, Panagiotis Zervas and Giannis Tzimas    
In this article, we investigate the potential of synthetic resumes as a means for the rapid generation of training data and their effectiveness in data augmentation, especially in categories marked by sparse samples. The widespread implementation of mach... ver más
Revista: Future Internet

 
Homeyra Mahmoudi, Silvana Camboim and Maria Antonia Brovelli    
Voice assistants can elevate interaction in geospatial data web platforms. This research introduces a voice assistant in the BStreams platform and focuses on understanding user commands in the geospatial domain. We developed a specialised geospatial disc... ver más