REVISTA
Applied Sciences

TODAS

Redirigiendo al acceso original de articulo en 20 segundos...

Inicio / Applied Sciences / Vol: 9 Par: 21 (2019) / Artículo

ARTÍCULO

TITULO

Improving Hybrid CTC/Attention Architecture with Time-Restricted Self-Attention CTC for End-to-End Speech Recognition

Long Wu

Ta Li

Li Wang and Yonghong Yan

Resumen

As demonstrated in hybrid connectionist temporal classification (CTC)/Attention architecture, joint training with a CTC objective is very effective to solve the misalignment problem existing in the attention-based end-to-end automatic speech recognition (ASR) framework. However, the CTC output relies only on the current input, which leads to the hard alignment issue. To address this problem, this paper proposes the time-restricted attention CTC/Attention architecture, which integrates an attention mechanism with the CTC branch. ?Time-restricted? means that the attention mechanism is conducted on a limited window of frames to the left and right. In this study, we first explore time-restricted location-aware attention CTC/Attention, establishing the proper time-restricted attention window size. Inspired by the success of self-attention in machine translation, we further introduce the time-restricted self-attention CTC/Attention that can better model the long-range dependencies among the frames. Experiments with wall street journal (WSJ), augmented multiparty interaction (AMI), and switchboard (SWBD) tasks demonstrate the effectiveness of the proposed time-restricted self-attention CTC/Attention. Finally, to explore the robustness of this method to noise and reverberation, we join a train neural beamformer frontend with the time-restricted attention CTC/Attention ASR backend in the CHIME-4 dataset. The reduction of word error rate (WER) and the increase of perceptual evaluation of speech quality (PESQ) approve the effectiveness of this framework.

Palabras claves

automatic speech recognition - end-to-end - CTC - self-attention - hybrid CTC/attention

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 9 Parte: 21 (2019)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Water
Algorithms
Applied Sciences

DOI

https://doi.org/10.3390/app9214639

Artículos similares

A Particle Swarm and Smell Agent-Based Hybrid Algorithm for Enhanced Optimization

Acceso

Abdullahi T. Sulaiman, Habeeb Bello-Salau, Adeiza J. Onumanyi, Muhammed B. Mu?azu, Emmanuel A. Adedokun, Ahmed T. Salawudeen and Abdulfatai D. Adekale

The particle swarm optimization (PSO) algorithm is widely used for optimization purposes across various domains, such as in precision agriculture, vehicular ad hoc networks, path planning, and for the assessment of mathematical test functions towards ben... ver más

Revista: Algorithms

An Ensemble CNOP Method Based on a Pre-Screening Mechanism for Targeted Observations in the South China Sea

Acceso

Ru Wang, Qingyu Zheng, Wei Li, Guijun Han, Xuan Wang and Song Hu

The uncertainty in the initial condition seriously affects the forecasting skill of numerical models. Targeted observations play an important role in reducing uncertainty in numerical prediction. The conditional nonlinear optimal perturbation (CNOP) meth... ver más

Revista: Journal of Marine Science and Engineering

Chinese Cyberbullying Detection Using XLNet and Deep Bi-LSTM Hybrid Model

Acceso

Shifeng Chen, Jialin Wang and Ketai He

The popularization of the internet and the widespread use of smartphones have led to a rapid growth in the number of social media users. While information technology has brought convenience to people, it has also given rise to cyberbullying, which has a ... ver más

Revista: Information

A Hybrid Forecasting Model for Self-Similar Traffic in LEO Mega-Constellation Networks

Acceso

Chi Han, Wei Xiong and Ronghuan Yu

Mega-constellation network traffic forecasting provides key information for routing and resource allocation, which is of great significance to the performance of satellite networks. However, due to the self-similarity and long-range dependence (LRD) of m... ver más

Revista: Aerospace

A New Approach to Identifying Sorghum Hybrids Using UAV Imagery Using Multispectral Signature and Machine Learning

Acceso

Dthenifer Cordeiro Santana, Gustavo de Faria Theodoro, Ricardo Gava, João Lucas Gouveia de Oliveira, Larissa Pereira Ribeiro Teodoro, Izabela Cristina de Oliveira, Fábio Henrique Rojo Baio, Carlos Antonio da Silva Junior, Job Teixeira de Oliveira and Paulo Eduardo Teodoro

Using multispectral sensors attached to unmanned aerial vehicles (UAVs) can assist in the collection of morphological and physiological information from several crops. This approach, also known as high-throughput phenotyping, combined with data processin... ver más

Revista: Algorithms

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas