Inicio  /  Applied Sciences  /  Vol: 10 Par: 6 (2020)  /  Artículo
ARTÍCULO
TITULO

An Unsupervised Deep Learning System for Acoustic Scene Analysis

Mou Wang    
Xiao-Lei Zhang and Susanto Rahardja    

Resumen

Acoustic scene analysis has attracted a lot of attention recently. Existing methods are mostly supervised, which requires well-predefined acoustic scene categories and accurate labels. In practice, there exists a large amount of unlabeled audio data, but labeling large-scale data is not only costly but also time-consuming. Unsupervised acoustic scene analysis on the other hand does not require manual labeling but is known to have significantly lower performance and therefore has not been well explored. In this paper, a new unsupervised method based on deep auto-encoder networks and spectral clustering is proposed. It first extracts a bottleneck feature from the original acoustic feature of audio clips by an auto-encoder network, and then employs spectral clustering to further reduce the noise and unrelated information in the bottleneck feature. Finally, it conducts hierarchical clustering on the low-dimensional output of the spectral clustering. To fully utilize the spatial information of stereo audio, we further apply the binaural representation and conduct joint clustering on that. To the best of our knowledge, this is the first time that a binaural representation is being used in unsupervised learning. Experimental results show that the proposed method outperforms the state-of-the-art competing methods.

 Artículos similares

       
 
Abrar Alamr and Abdelmonim Artoli    
Anomaly detection is one of the basic issues in data processing that addresses different problems in healthcare sensory data. Technology has made it easier to collect large and highly variant time series data; however, complex predictive analysis models ... ver más
Revista: Algorithms

 
Dominik Stallmann and Barbara Hammer    
Novel neural network models that can handle complex tasks with fewer examples than before are being developed for a wide range of applications. In some fields, even the creation of a few labels is a laborious task and impractical, especially for data tha... ver más
Revista: Algorithms

 
Alireza Saberironaghi, Jing Ren and Moustafa El-Gindy    
Over the last few decades, detecting surface defects has attracted significant attention as a challenging task. There are specific classes of problems that can be solved using traditional image processing techniques. However, these techniques struggle wi... ver más
Revista: Algorithms

 
Navaneethakrishna Makaram, Sarvagya Gupta, Matthew Pesce, Jeffrey Bolton, Scellig Stone, Daniel Haehn, Marc Pomplun, Christos Papadelis, Phillip Pearl, Alexander Rotenberg, Patricia Ellen Grant and Eleonora Tamilia    
In drug-resistant epilepsy, a visual inspection of intracranial electroencephalography (iEEG) signals is often needed to localize the epileptogenic zone (EZ) and guide neurosurgery. The visual assessment of iEEG time-frequency (TF) images is an alternati... ver más
Revista: Algorithms

 
Paolo Massimo Buscema, Giulia Massini, Giovanbattista Raimondi, Giuseppe Caporaso, Marco Breda and Riccardo Petritoli    
The automatic identification system (AIS) facilitates the monitoring of ship movements and provides essential input parameters for traffic safety. Previous studies have employed AIS data to detect behavioral anomalies and classify vessel types using supe... ver más
Revista: Algorithms