|
|
|
Linkai Peng, Yingming Gao, Rian Bao, Ya Li and Jinsong Zhang
As an indispensable module of computer-aided pronunciation training (CAPT) systems, mispronunciation detection and diagnosis (MDD) techniques have attracted a lot of attention from academia and industry over the past decade. To train robust MDD models, t...
ver más
|
|
|
|
|
|
|
Jiachen Zhang, Guoqing Tu, Shubo Liu and Zhaohui Cai
The rapid development of speech synthesis technology has significantly improved the naturalness and human-likeness of synthetic speech. As the technical barriers for speech synthesis are rapidly lowering, the number of illegal activities such as fraud an...
ver más
|
|
|
|
|
|
|
Konlakorn Wongpatikaseree, Sattaya Singkul, Narit Hnoohom and Sumeth Yuenyong
Language resources are the main factor in speech-emotion-recognition (SER)-based deep learning models. Thai is a low-resource language that has a smaller data size than high-resource languages such as German. This paper describes the framework of using a...
ver más
|
|
|
|