6 Artículos

MLLP-VRAIN Spanish ASR Systems for the Albayzín-RTVE 2020 Speech-to-Text Challenge: Extension

Acceso

en línea

Pau Baquero-Arnal, Javier Jorge, Adrià Giménez, Javier Iranzo-Sánchez, Alejandro Pérez, Gonçal Vicent Garcés Díaz-Munío, Joan Albert Silvestre-Cerdà, Jorge Civera, Albert Sanchis and Alfons Juan

This work has direct application in live automatic captioning of audiovisual material, which is fundamental in accessibility.

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 2 Año: 2022

A 2D Convolutional Gating Mechanism for Mandarin Streaming Speech Recognition

Acceso

en línea

Xintong Wang and Chuangang Zhao

Recent research shows recurrent neural network-Transducer (RNN-T) architecture has become a mainstream approach for streaming speech recognition. In this work, we investigate the VGG2 network as the input layer to the RNN-T in streaming speech recognitio... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 4 Año: 2021

Web Radio Automation for Audio Stream Management in the Era of Big Data

Acceso

en línea

Nikolaos Vryzas, Nikolaos Tsipas and Charalampos Dimoulas

Radio is evolving in a changing digital media ecosystem. Audio-on-demand has shaped the landscape of big unstructured audio data available online. In this paper, a framework for knowledge extraction is introduced, to improve discoverability and enrichmen... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 4 Año: 2020

« Anterior Página: 1 de 1 Siguiente »