Inicio  /  Applied Sciences  /  Vol: 12 Par: 2 (2022)  /  Artículo
ARTÍCULO
TITULO

Sound Source Separation Mechanisms of Different Deep Networks Explained from the Perspective of Auditory Perception

Han Li    
Kean Chen    
Lei Wang    
Jianben Liu    
Baoquan Wan and Bing Zhou    

Resumen

Thanks to the development of deep learning, various sound source separation networks have been proposed and made significant progress. However, the study on the underlying separation mechanisms is still in its infancy. In this study, deep networks are explained from the perspective of auditory perception mechanisms. For separating two arbitrary sound sources from monaural recordings, three different networks with different parameters are trained and achieve excellent performances. The networks? output can obtain an average scale-invariant signal-to-distortion ratio improvement (SI-SDRi) higher than 10 dB, comparable with the human performance to separate natural sources. More importantly, the most intuitive principle?proximity?is explored through simultaneous and sequential organization experiments. Results show that regardless of network structures and parameters, the proximity principle is learned spontaneously by all networks. If components are proximate in frequency or time, they are not easily separated by networks. Moreover, the frequency resolution at low frequencies is better than at high frequencies. These behavior characteristics of all three networks are highly consistent with those of the human auditory system, which implies that the learned proximity principle is not accidental, but the optimal strategy selected by networks and humans when facing the same task. The emergence of the auditory-like separation mechanisms provides the possibility to develop a universal system that can be adapted to all sources and scenes.

 Artículos similares

       
 
Youngmin Kim, Donghwan Kim, Sunho Park, Yonghwa Kim, Jisoo Hong, Sunghee Hong, Jinsoo Jeong, Byounghyo Lee and Hyeonchan Oh    
Digital music is one of the most important commodities on the market due to music royalty distribution in Korea. As the music market has been transformed into a digital music market by means such as downloading and streaming, the distribution of music ro... ver más
Revista: Applied Sciences

 
Grigory Dolgikh, Yuri Morgunov, Alexander Burenin, Vladimir Bezotvetnykh, Vladimir Luchin, Aleksandr Golov and Alexander Tagiltsev    
The methodological and technical possibilities of monitoring temperature fields in the Sea of Japan by acoustic thermometry methods are presented. The proposed tomographic method for monitoring the dynamics and structure of water is based on the transmis... ver más

 
Ming Li, Kefeng Liu, Hongchen Li, Yiyuan Sun, Xi Chen and Kefeng Mao    
At present, some shortcomings of the research on coupling modeling of the oceanic front?sound field may need attention: (1) Most of the acoustic propagation simulation is based on ideal front models, but the application of investigated marine data is lac... ver más

 
Xing Zhao, Xiaoyang Jia, Lin Li and Hanyu Wang    
In this paper, we aim to address the challenge of airflow interference during fault detection in high-speed train bogies by introducing a flow field and investigating the characteristics of the sound field distribution of critical components under its in... ver más
Revista: Applied Sciences

 
Yadong Zhang and Yijun Liu    
A new approach to accelerating the evaluation of monopole and dipole source integrals via the fast multipole method (FMM) in the time domain for general three-dimensional (3-D) aeroacoustic problems is presented in this paper. In this approach, the aeroa... ver más
Revista: Acoustics