|
|
|
Achintya Kumar Sarkar and Zheng-Hua Tan
Deep representation learning has gained significant momentum in advancing text-dependent speaker verification (TD-SV) systems. When designing deep neural networks (DNN) for extracting bottleneck (BN) features, the key considerations include training targ...
ver más
|
|
|
|
|
|
|
Jiachen Zhang, Guoqing Tu, Shubo Liu and Zhaohui Cai
The rapid development of speech synthesis technology has significantly improved the naturalness and human-likeness of synthetic speech. As the technical barriers for speech synthesis are rapidly lowering, the number of illegal activities such as fraud an...
ver más
|
|
|
|
|
|
|
Juan Carlos Atenco, Juan Carlos Moreno and Juan Manuel Ramirez
In this work we present a bimodal multitask network for audiovisual biometric recognition. The proposed network performs the fusion of features extracted from face and speech data through a weighted sum to jointly optimize the contribution of each modali...
ver más
|
|
|
|
|
|
|
Wondimu Lambamo, Ramasamy Srinivasagan and Worku Jifara
The performance of speaker recognition systems is very well on the datasets without noise and mismatch. However, the performance gets degraded with the environmental noises, channel variation, physical and behavioral changes in speaker. The types of Spea...
ver más
|
|
|
|
|
|
|
Fei Xie, Dalong Zhang and Chengming Liu
Transformer models are now widely used for speech processing tasks due to their powerful sequence modeling capabilities. Previous work determined an efficient way to model speaker embeddings using the Transformer model by combining transformers with conv...
ver más
|
|
|
|
|
|
|
Zesheng Chen, Li-Chi Chang, Chao Chen, Guoping Wang and Zhuming Bi
Speaker verification systems use human voices as an important biometric to identify legitimate users, thus adding a security layer to voice-controlled Internet-of-things smart homes against illegal access. Recent studies have demonstrated that speaker ve...
ver más
|
|
|
|
|
|
|
Ali Bou Nassif, Ismail Shahin, Mohammed Lataifeh, Ashraf Elnagar and Nawel Nemmour
Speech signals carry various bits of information relevant to the speaker such as age, gender, accent, language, health, and emotions. Emotions are conveyed through modulations of facial and vocal expressions. This paper conducts an empirical comparison o...
ver más
|
|
|
|
|
|
|
Sung-Hyun Yoon, Jong-June Jeon and Ha-Jin Yu
|
|
|
|
|
|
|
Francesc Alías, Antonio Bonafonte and António Teixeira
The main goal of this Special Issue is to present the latest advances in research and novel applications of speech and language technologies based on the works presented at the IberSPEECH edition held in Barcelona in 2018, paying special attention to tho...
ver más
|
|
|
|
|
|
|
Oleksandr Yudin,Ruslana Ziubina,Serhii Buchyk,Olena Matviichuk-Yudina,Olha Suprun,Viktoriia Ivannikova
Pág. 56 - 64
Methods for verifying and identifying the operator by the features of the formation of biometric features of a speech signal in control systems of unmanned aerial systems are proposed.A method has been developed for the effective width of the spectrum of...
ver más
|
|
|
|