5 Artículos

End-to-End Mispronunciation Detection and Diagnosis Using Transfer Learning

Acceso

en línea

Linkai Peng, Yingming Gao, Rian Bao, Ya Li and Jinsong Zhang

As an indispensable module of computer-aided pronunciation training (CAPT) systems, mispronunciation detection and diagnosis (MDD) techniques have attracted a lot of attention from academia and industry over the past decade. To train robust MDD models, t... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 11 Año: 2023

Audio Anti-Spoofing Based on Audio Feature Fusion

Acceso

en línea

Jiachen Zhang, Guoqing Tu, Shubo Liu and Zhaohui Cai

The rapid development of speech synthesis technology has significantly improved the naturalness and human-likeness of synthetic speech. As the technical barriers for speech synthesis are rapidly lowering, the number of illegal activities such as fraud an... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 7 Año: 2023

Real-Time End-to-End Speech Emotion Recognition with Cross-Domain Adaptation

Acceso

en línea

Konlakorn Wongpatikaseree, Sattaya Singkul, Narit Hnoohom and Sumeth Yuenyong

Language resources are the main factor in speech-emotion-recognition (SER)-based deep learning models. Thai is a low-resource language that has a smaller data size than high-resource languages such as German. This paper describes the framework of using a... ver más

Revista: Big Data and Cognitive Computing Formato: Electrónico

Tabla de contenido: Vol: 6 Num: 0 Par: 3 Año: 2022

« Anterior Página: 1 de 1 Siguiente »