ARTÍCULO
TITULO

Joint Representation and Recognition for Ship-Radiated Noise Based on Multimodal Deep Learning

Fei Yuan    
Xiaoquan Ke and En Cheng    

Resumen

Ship recognition based on ship-radiated noise is one of the most important and challenging subjects in underwater acoustic signal processing. The recognition methods for ship-radiated noise recognition include traditional methods and deep learning (DL) methods. Developing from the DL methods and inspired by audio?video speech recognition (AVSR), the paper further introduces multimodal deep learning (multimodal-DL) methods for the recognition of ship-radiated noise. In this paper, ship-radiated noise (acoustics modality) and visual observation of the ships (visual modality) are two different modalities that the multimodal-DL methods model on. The paper specially designs a multimodal-DL framework, the multimodal convolutional neural networks (multimodal-CNNs) for the recognition of ship-radiated noise. Then the paper proposes a strategy based on canonical correlation analysis (CCA-based strategy) to build a joint representation and recognition on the two different single-modality (acoustics modality and visual modality). The multimodal-CNNs and the CCA-based strategy are tested on real ship-radiated noise data recorded. Experimental results show that, using the CCA-based strategy, strong-discriminative information can be built from weak-discriminative information provided from a single-modality. Experimental results also show that as long as any one of the single-modalities can provide information for the recognition, the multimodal-DL methods can have a much better multiclass recognition performance than the DL methods. The paper also discusses the advantages and superiorities of the multimodal-Dl methods over the traditional methods for ship-radiated noise recognition.

 Artículos similares

       
 
Clarissa Pires Vieira Serta, Sverre Haver and Lin Li    
The estimation of long-term extreme response is a crucial task in the design of marine structures. The target extreme responses are typically defined by annual exceedance probabilities of 10-2 and 10-4. Various approaches can be employed for this purpose... ver más

 
Ivan Benemerito, Erica Montefiori, Alberto Marzo and Claudia Mazzà    
Musculoskeletal models (MSKMs) are used to estimate the muscle and joint forces involved in human locomotion, often associated with the onset of degenerative musculoskeletal pathologies (e.g., osteoarthritis). Subject-specific MSKMs offer more accurate p... ver más
Revista: Applied Sciences

 
Huiyan Wu and Jun Huang    
The main purpose of the joint entity and relation extraction is to extract entities from unstructured texts and extract the relation between labeled entities at the same time. At present, most existing joint entity and relation extraction networks ignore... ver más
Revista: Applied Sciences

 
Sukjun Park and Nakhoon Baek    
Recently, ray tracing techniques have been highly adopted to produce high quality images and animations. In this paper, we present our design and implementation of a real-time ray-traced rendering engine. We achieved real-time capability for triangle pri... ver más
Revista: Applied Sciences

 
Zhi-Yong Wang and Dae-Ki Kang    
CORrelation ALignment (CORAL) is an unsupervised domain adaptation method that uses a linear transformation to align the covariances of source and target domains. Deep CORAL extends CORAL with a nonlinear transformation using a deep neural network and ad... ver más
Revista: Applied Sciences