Redirigiendo al acceso original de articulo en 23 segundos...
Inicio  /  Acoustics  /  Vol: 4 Par: 3 (2022)  /  Artículo
ARTÍCULO
TITULO

Double-Talk Detection-Aided Residual Echo Suppression via Spectrogram Masking and Refinement

Eran Shachar    
Israel Cohen and Baruch Berdugo    

Resumen

Acoustic echo in full-duplex telecommunication systems is a common problem that may cause desired-speech quality degradation during double-talk periods. This problem is especially challenging in low signal-to-echo ratio (SER) scenarios, such as hands-free conversations over mobile phones when the loudspeaker volume is high. This paper proposes a two-stage deep-learning approach to residual echo suppression focused on the low SER scenario. The first stage consists of a speech spectrogram masking model integrated with a double-talk detector (DTD). The second stage consists of a spectrogram refinement model optimized for speech quality by minimizing a perceptual evaluation of speech quality (PESQ) related loss function. The proposed integration of DTD with the masking model outperforms several other configurations based on previous studies. We conduct an ablation study that shows the contribution of each part of the proposed system. We evaluate the proposed system in several SERs and demonstrate its efficiency in the challenging setting of a very low SER. Finally, the proposed approach outperforms competing methods in several residual echo suppression metrics. We conclude that the proposed system is well-suited for the task of low SER residual echo suppression.

 Artículos similares

       
 
Pietro Dell?Oglio, Alessandro Bondielli and Francesco Marcelloni    
Today, most newspapers utilize social media to disseminate news. On the one hand, this results in an overload of related articles for social media users. On the other hand, since social media tends to form echo chambers around their users, different opin... ver más
Revista: Algorithms

 
Hui Sheng, Min Liu, Jiyong Hu, Ping Li, Yali Peng and Yugen Yi    
Time-series data is an appealing study topic in data mining and has a broad range of applications. Many approaches have been employed to handle time series classification (TSC) challenges with promising results, among which deep neural network methods ha... ver más
Revista: Information

 
Hongbing Li and Qunfei Zhang    
Transmitting orthogonal waveforms are the basis for giving full play to the advantages of MIMO radar imaging technology, but the commonly used waveforms with the same frequency cannot meet the orthogonality requirement, resulting in serious coupling nois... ver más
Revista: Algorithms

 
Weiliang Tao, Yan Liu, Zhimin Ma and Wenbin Hu    
This paper proposes a novel particle image velocimetry (PIV) technique to generate an instantaneous two-dimensional velocity field for sediment-laden fluid based on the optical flow algorithm of ultrasound imaging. In this paper, an ultrasonic PIV (UIV) ... ver más
Revista: Applied Sciences

 
Yufan Yang, Chunlei Wei, Fan Yang, Tianyi Lu, Langfeng Zhu and Jun Wei    
An algorithm based on a long short-term memory (LSTM) network is proposed to reduce errors from high-frequency surface wave radar current measurements. In traditional inversion algorithms, the radar velocities are derived from electromagnetic echo signal... ver más
Revista: Applied Sciences