Redirigiendo al acceso original de articulo en 23 segundos...
Inicio  /  Algorithms  /  Vol: 16 Par: 2 (2023)  /  Artículo
ARTÍCULO
TITULO

Audiovisual Biometric Network with Deep Feature Fusion for Identification and Text Prompted Verification

Juan Carlos Atenco    
Juan Carlos Moreno and Juan Manuel Ramirez    

Resumen

In this work we present a bimodal multitask network for audiovisual biometric recognition. The proposed network performs the fusion of features extracted from face and speech data through a weighted sum to jointly optimize the contribution of each modality, aiming for the identification of a client. The extracted speech features are simultaneously used in a speech recognition task with random digit sequences. Text prompted verification is performed by fusing the scores obtained from the matching of bimodal embeddings with the Word Error Rate (WER) metric calculated from the accuracy of the transcriptions. The score fusion outputs a value that can be compared with a threshold to accept or reject the identity of a client. Training and evaluation was carried out by using our proprietary database BIOMEX-DB and VidTIMIT audiovisual database. Our network achieved an accuracy of 100% and an Equal Error Rate (EER) of 0.44% for identification and verification, respectively, in the best case. To the best of our knowledge, this is the first system that combines the mutually related tasks previously described for biometric recognition.

 Artículos similares

       
 
Maryke A. Mihai     Pág. 10 bladsye
The purpose of this study was to describe the management challenges and the type of management approach that developed during the implementation of an ICT network between six schools, transmitting Maths and Science lessons for grade 12 learners. In ... ver más

 
Yuntao Shi, Qi Luo, Meng Zhou, Wei Guo, Jie Li, Shuqin Li and Yu Ding    
Objects thrown from tall buildings in communities are characterized by their small size, inconspicuous features, and high speed. Existing algorithms for detecting such objects face challenges, including excessive parameters, overly complex models that ar... ver más
Revista: Information

 
Longxin Yao, Yun Lu, Mingjiang Wang, Yukun Qian and Heng Li    
The construction of complex networks from electroencephalography (EEG) proves to be an effective method for representing emotion patterns in affection computing as it offers rich spatiotemporal EEG features associated with brain emotions. In this paper, ... ver más
Revista: Applied Sciences

 
Wendong Yang, Yun Jiang, Yulin Chi, Zhengjia Xu and Wenbin Wei    
The continuous and strategic planning of full-service carriers plays a prominent role in transferring and adapting them into resilient full-service carrier network structures. The exploration of full-service carrier network structures using the latest lo... ver más
Revista: Aerospace

 
Jin Su Kim, Cheol Ho Song, Jae Myung Kim, Jimin Lee, Yeong-Hyeon Byeon, Jaehyo Jung, Hyun-Sik Choi, Keun-Chang Kwak, Youn Tae Kim, EunSang Bak and Sungbum Pan    
Current advancements in biosignal-based user recognition technology are paving the way for a next-generation solution that addresses the limitations of face- and fingerprint-based user recognition methods. However, existing biosignal benchmark databases ... ver más
Revista: Applied Sciences