183 Artículos

A Modular Framework for Domain-Specific Conversational Systems Powered by Never-Ending Learning

Acceso

en línea

Felipe Coelho de Abreu Pinna, Victor Takashi Hayashi, João Carlos Néto, Rosangela de Fátima Pereira Marquesone, Maísa Cristina Duarte, Rodrigo Suzuki Okada and Wilson Vicente Ruggiero

Complex and long interactions (e.g., a change of topic during a conversation) justify the use of dialog systems to develop task-oriented chatbots and intelligent virtual assistants. The development of dialog systems requires considerable effort and takes... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 4 Año: 2024

Customization of the ASR System for ATC Speech with Improved Fusion

Acceso

en línea

Jiahao Fan and Weijun Pan

In recent years, automatic speech recognition (ASR) technology has improved significantly. However, the training process for an ASR model is complex, involving large amounts of data and a large number of algorithms. The task of training a new model for a... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 3 Año: 2024

Tibetan Sentence Boundaries Automatic Disambiguation Based on Bidirectional Encoder Representations from Transformers on Byte Pair Encoding Word Cutting Method

Acceso

en línea

Fenfang Li, Zhengzhang Zhao, Li Wang and Han Deng

Sentence Boundary Disambiguation (SBD) is crucial for building datasets for tasks such as machine translation, syntactic analysis, and semantic analysis. Currently, most automatic sentence segmentation in Tibetan adopts the methods of rule-based and stat... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 7 Año: 2024

Minoan Cryptanalysis: Computational Approaches to Deciphering Linear A and Assessing Its Connections with Language Families from the Mediterranean and the Black Sea Areas

Acceso

en línea

Aaradh Nepal and Francesco Perono Cacciafoco

During the Bronze Age, the inhabitants of regions of Crete, mainland Greece, and Cyprus inscribed their languages using, among other scripts, a writing system called Linear A. These symbols, mainly characterized by combinations of lines, have, since thei... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 2 Año: 2024

Dementia Detection from Speech: What If Language Models Are Not the Answer?

Acceso

en línea

Mondher Bouazizi, Chuheng Zheng, Siyuan Yang and Tomoaki Ohtsuki

A growing focus among scientists has been on researching the techniques of automatic detection of dementia that can be applied to the speech samples of individuals with dementia. Leveraging the rapid advancements in Deep Learning (DL) and Natural Languag... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 1 Año: 2024

Beyond Lexical Boundaries: LLM-Generated Text Detection for Romanian Digital Libraries

Acceso

en línea

Melania Nitu and Mihai Dascalu

Machine-generated content reshapes the landscape of digital information; hence, ensuring the authenticity of texts within digital libraries has become a paramount concern. This work introduces a corpus of approximately 60 k Romanian documents, including ... ver más

Revista: Future Internet Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 2 Año: 2024

Research on Entity and Relationship Extraction with Small Training Samples for Cotton Pests and Diseases

Acceso

en línea

Weiwei Yuan, Wanxia Yang, Liang He, Tingwei Zhang, Yan Hao, Jing Lu and Wenbo Yan

The extraction of entities and relationships is a crucial task in the field of natural language processing (NLP). However, existing models for this task often rely heavily on a substantial amount of labeled data, which not only consumes time and labor bu... ver más

Revista: Agriculture Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 3 Año: 2024

Automatic Translation between Mixtec to Spanish Languages Using Neural Networks

Acceso

en línea

Hermilo Santiago-Benito , Diana-Margarita Córdova-Esparza , Noé-Alejandro Castro-Sánchez , Teresa García-Ramirez , Julio-Alejandro Romero-González and Juan Terven

This paper introduces a novel method for collecting and translating texts from the Mixtec to the Spanish language. The method comprises four primary steps. First, we collected a Mixtec?Spanish corpus that includes 4568 sentences from educational and reli... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 7 Año: 2024

Four Million Segments and Counting: Building an English-Croatian Parallel Corpus through Crowdsourcing Using a Novel Gamification-Based Platform

Acceso

en línea

Rafal Jaworski, Sanja Seljan and Ivan Dunder

Parallel corpora have been widely used in the fields of natural language processing and translation as they provide crucial multilingual information. They are used to train machine translation systems, compile dictionaries, or generate inter-language wor... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 4 Año: 2023

Computers? Interpretations of Knowledge Representation Using Pre-Conceptual Schemas: An Approach Based on the BERT and Llama 2-Chat Models

Acceso

en línea

Jesus Insuasti, Felipe Roa and Carlos Mario Zapata-Jaramillo

Pre-conceptual schemas are a straightforward way to represent knowledge using controlled language regardless of context. Despite the benefits of using pre-conceptual schemas by humans, they present challenges when interpreted by computers. We propose an ... ver más

Revista: Big Data and Cognitive Computing Formato: Electrónico

Tabla de contenido: Vol: 7 Num: 0 Par: 4 Año: 2023

« Anterior Página: 1 de 11 Siguiente »