Redirigiendo al acceso original de articulo en 19 segundos...
Inicio  /  Algorithms  /  Vol: 15 Par: 2 (2022)  /  Artículo
ARTÍCULO
TITULO

Whispered Speech Conversion Based on the Inversion of Mel Frequency Cepstral Coefficient Features

Qiang Zhu    
Zhong Wang    
Yunfeng Dou and Jian Zhou    

Resumen

A conversion method based on the inversion of Mel frequency cepstral coefficient (MFCC) features was proposed to convert whispered speech into normal speech. First, the MFCC features of whispered speech and normal speech were extracted and a matching relation between the MFCC feature parameters of whispered speech and normal speech was developed through the Gaussian mixture model (GMM). Then, the MFCC feature parameters of normal speech corresponding to whispered speech were obtained based on the GMM and, finally, whispered speech was converted into normal speech through the inversion of MFCC features. The experimental results showed that the cepstral distortion (CD) of the normal speech converted by the proposed method was 21% less than that of the normal speech converted by the linear predictive coefficient (LPC) features, the mean opinion score (MOS) was 3.56, and a satisfactory outcome in both intelligibility and sound quality was achieved.

 Artículos similares

       
 
Valtteri Hongisto and Jukka Keränen    
ISO 3382-3 is globally used to determine the room acoustic conditions of open-plan offices using in situ measurements. The key outcomes of the standard are three single-number quantities: distraction distance, rD, A-weighted sound pressure level of speec... ver más
Revista: Applied Sciences

 
Bhagya Nathali Silva, Murad Khan, Ruchire Eranga Wijesinghe, Samantha Thelijjagoda and Kijun Han    
Survivors of either a hemorrhagic or ischemic stroke tend to acquire aphasia and experience spontaneous recovery during the first six months. Nevertheless, a considerable number of patients sustain aphasia and require speech and language therapy to overc... ver más
Revista: Applied Sciences