REVISTA
AI

TODAS

Redirigiendo al acceso original de articulo en 20 segundos...

Inicio / AI / Vol: 4 Par: 4 (2023) / Artículo

ARTÍCULO

TITULO

Chatbots Put to the Test in Math and Logic Problems: A Comparison and Assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard

Vagelis Plevris

George Papazafeiropoulos and Alejandro Jiménez Rios

Resumen

In an age where artificial intelligence is reshaping the landscape of education and problem solving, our study unveils the secrets behind three digital wizards, ChatGPT-3.5, ChatGPT-4, and Google Bard, as they engage in a thrilling showdown of mathematical and logical prowess. We assess the ability of the chatbots to understand the given problem, employ appropriate algorithms or methods to solve it, and generate coherent responses with correct answers. We conducted our study using a set of 30 questions. These questions were carefully crafted to be clear, unambiguous, and fully described using plain text only. Each question has a unique and well-defined correct answer. The questions were divided into two sets of 15: Set A consists of ?Original? problems that cannot be found online, while Set B includes ?Published? problems that are readily available online, often with their solutions. Each question was presented to each chatbot three times in May 2023. We recorded and analyzed their responses, highlighting their strengths and weaknesses. Our findings indicate that chatbots can provide accurate solutions for straightforward arithmetic, algebraic expressions, and basic logic puzzles, although they may not be consistently accurate in every attempt. However, for more complex mathematical problems or advanced logic tasks, the chatbots? answers, although they appear convincing, may not be reliable. Furthermore, consistency is a concern as chatbots often provide conflicting answers when presented with the same question multiple times. To evaluate and compare the performance of the three chatbots, we conducted a quantitative analysis by scoring their final answers based on correctness. Our results show that ChatGPT-4 performs better than ChatGPT-3.5 in both sets of questions. Bard ranks third in the original questions of Set A, trailing behind the other two chatbots. However, Bard achieves the best performance, taking first place in the published questions of Set B. This is likely due to Bard?s direct access to the internet, unlike the ChatGPT chatbots, which, due to their designs, do not have external communication capabilities.

Palabras claves

chatbot - AI - logic - mathematics - ChatGPT - GPT-3.5 - GPT-4 - Google Bard

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 4 Parte: 4 (2023)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

International Journal of Open Information Technologies
Information
Water

DOI

https://doi.org/10.3390/ai4040048

Artículos similares

AI Chatbots in Digital Mental Health

Acceso

Luke Balcombe

Artificial intelligence (AI) chatbots have gained prominence since 2022. Powered by big data, natural language processing (NLP) and machine learning (ML) algorithms, they offer the potential to expand capabilities, improve productivity and provide guidan... ver más

Revista: Informatics

Automatic Recommendation of Forum Threads and Reinforcement Activities in a Data Structure and Programming Course

Acceso

Laura Plaza, Lourdes Araujo, Fernando López-Ostenero and Juan Martínez-Romo

Online learning is quickly becoming a popular choice instead of traditional education. One of its key advantages lies in the flexibility it offers, allowing individuals to tailor their learning experiences to their unique schedules and commitments. Moreo... ver más

Revista: Applied System Innovation

The Socio-Environmental and Human Health Problems Related to the Use of Pesticides and the Use of Advanced Oxidative Processes for Their Degradation: Brazil

Acceso

Anna Karla Santos Pereira, Lívia Fernandes Silva, Gustavo Antonio Figueredo Barbosa, Thaynara Guimarães Miranda, Rayane Reis Sousa, Renato Almeida Sarmento, Nelson Luís Gonçalves Dias Souza, Douglas Henrique Pereira and Grasiele Soares Cavallini

The present study reviews the quantitative data on the use of pesticides and their relationship to environmental and human health problems in Brazil. The detection of residual concentrations of pesticides in food and water consumed by humans has raised q... ver más

Revista: Water

Visitors? Environmental Concerns in Gray?s Reef National Marine Sanctuary: An Offshore Marine Protected Area

Acceso

Marieke Lemmen, Robert C. Burns, Ross G. Andrew and Jasmine Cardozo Moreira

Marine sanctuaries serve as popular destinations for ecotourism, natural resource exploration, and recreation across the US. While often positive, visitation in marine and coastal areas can cause ecological threats to these ecosystems. Increased visitati... ver más

Revista: Water

Strengthening the Security of Smart Contracts through the Power of Artificial Intelligence

Acceso

Moez Krichen

Smart contracts (SCs) are digital agreements that execute themselves and are stored on a blockchain. Despite the fact that they offer numerous advantages, such as automation and transparency, they are susceptible to a variety of assaults due to their com... ver más

Revista: Computers

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas