143 Artículos

Predicting Individual Well-Being in Teamwork Contexts Based on Speech Features

Acceso

en línea

Tobias Zeulner, Gerhard Johann Hagerer, Moritz Müller, Ignacio Vazquez and Peter A. Gloor

Current methods for assessing individual well-being in team collaboration at the workplace often rely on manually collected surveys. This limits continuous real-world data collection and proactive measures to improve team member workplace satisfaction. W... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 4 Año: 2024

Whisper40: A Multi-Person Chinese Whisper Speaker Recognition Dataset Containing Same-Text Neutral Speech

Acceso

en línea

Jingwen Yang and Ruohua Zhou

Whisper speaker recognition (WSR) has received extensive attention from researchers in recent years, and it plays an important role in medical, judicial, and other fields. Among them, the establishment of a whisper dataset is very important for the study... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 4 Año: 2024

A Dual-Branch Speech Enhancement Model with Harmonic Repair

Acceso

en línea

Lizhen Jia, Yanyan Xu and Dengfeng Ke

Recent speech enhancement studies have mostly focused on completely separating noise from human voices. Due to the lack of specific structures for harmonic fitting in previous studies and the limitations of the traditional convolutional receptive field, ... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 4 Año: 2024

Sign-to-Text Translation from Panamanian Sign Language to Spanish in Continuous Capture Mode with Deep Neural Networks

Acceso

en línea

Alvaro A. Teran-Quezada, Victor Lopez-Cabrera, Jose Carlos Rangel and Javier E. Sanchez-Galan

Convolutional neural networks (CNN) have provided great advances for the task of sign language recognition (SLR). However, recurrent neural networks (RNN) in the form of long?short-term memory (LSTM) have become a means for providing solutions to problem... ver más

Revista: Big Data and Cognitive Computing Formato: Electrónico

Tabla de contenido: Vol: 8 Num: 0 Par: 3 Año: 2024

Analyzing Noise Robustness of Cochleogram and Mel Spectrogram Features in Deep Learning Based Speaker Recognition

Acceso

en línea

Wondimu Lambamo, Ramasamy Srinivasagan and Worku Jifara

The performance of speaker recognition systems is very well on the datasets without noise and mismatch. However, the performance gets degraded with the environmental noises, channel variation, physical and behavioral changes in speaker. The types of Spea... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 1 Año: 2023

An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain

Acceso

en línea

Driss Khalil, Amrutha Prasad, Petr Motlicek, Juan Zuluaga-Gomez, Iuliia Nigmatulina, Srikanth Madikeri and Christof Schuepbach

In air traffic management (ATM), voice communications are critical for ensuring the safe and efficient operation of aircraft. The pertinent voice communications?air traffic controller (ATCo) and pilot?are usually transmitted in a single channel, which po... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 10 Año: 2023

SAPBERT: Speaker-Aware Pretrained BERT for Emotion Recognition in Conversation

Acceso

en línea

Seunguook Lim and Jihie Kim

Emotion recognition in conversation (ERC) is receiving more and more attention, as interactions between humans and machines increase in a variety of services such as chat-bot and virtual assistants. As emotional expressions within a conversation can heav... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 1 Año: 2023

On Training Targets and Activation Functions for Deep Representation Learning in Text-Dependent Speaker Verification

Acceso

en línea

Achintya Kumar Sarkar and Zheng-Hua Tan

Deep representation learning has gained significant momentum in advancing text-dependent speaker verification (TD-SV) systems. When designing deep neural networks (DNN) for extracting bottleneck (BN) features, the key considerations include training targ... ver más

Revista: Acoustics Formato: Electrónico

Tabla de contenido: Vol: 5 Num: 0 Par: 3 Año: 2023

Audio Anti-Spoofing Based on Audio Feature Fusion

Acceso

en línea

Jiachen Zhang, Guoqing Tu, Shubo Liu and Zhaohui Cai

The rapid development of speech synthesis technology has significantly improved the naturalness and human-likeness of synthetic speech. As the technical barriers for speech synthesis are rapidly lowering, the number of illegal activities such as fraud an... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 7 Año: 2023

Audiovisual Biometric Network with Deep Feature Fusion for Identification and Text Prompted Verification

Acceso

en línea

Juan Carlos Atenco, Juan Carlos Moreno and Juan Manuel Ramirez

In this work we present a bimodal multitask network for audiovisual biometric recognition. The proposed network performs the fusion of features extracted from face and speech data through a weighted sum to jointly optimize the contribution of each modali... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 2 Año: 2023

« Anterior Página: 1 de 9 Siguiente »