|
|
|
Wondimu Lambamo, Ramasamy Srinivasagan and Worku Jifara
The performance of speaker recognition systems is very well on the datasets without noise and mismatch. However, the performance gets degraded with the environmental noises, channel variation, physical and behavioral changes in speaker. The types of Spea...
ver más
|
|
|
|
|
|
|
Jing Yang, Hongyu Yang, Zhengyuan Wu and Xiping Wu
Due to increased air traffic flow, air traffic controllers (ATCs) operate in a state of high load or even overload for long periods of time, which can seriously affect the reliability and efficiency of controllers? commands. Thus, the early identificatio...
ver más
|
|
|
|
|
|
|
Dana Utebayeva, Lyazzat Ilipbayeva and Eric T. Matson
The detection and classification of engine-based moving objects in restricted scenes from acoustic signals allow better Unmanned Aerial System (UAS)-specific intelligent systems and audio-based surveillance systems. Recurrent Neural Networks (RNNs) provi...
ver más
|
|
|
|
|
|
|
Shuang Yang, Lingzhi Xue, Xi Hong and Xiangyang Zeng
Recently, deep learning has been widely used in ship-radiated noise classification. To improve classification efficiency, avoiding high computational costs is an important research direction in ship-radiated noise classification. We propose a lightweight...
ver más
|
|
|
|
|
|
|
Mei Wang, Qingshan Mei, Xiyu Song, Xin Liu, Ruixiang Kan, Fangzhi Yao, Junhan Xiong and Hongbing Qiu
Unsupervised anomalous sound detection by machines holds significant importance within the realm of industrial automation. Currently, the task of machine-based anomalous sound detection in complex industrial settings is faced with issues such as the chal...
ver más
|
|
|
|
|
|
|
Omar Adel, Karma M. Fathalla and Ahmed Abo ElFarag
Emotion recognition is crucial in artificial intelligence, particularly in the domain of human?computer interaction. The ability to accurately discern and interpret emotions plays a critical role in helping machines to effectively decipher users? underly...
ver más
|
|
|
|
|
|
|
Diego de Benito-Gorrón, Daniel Ramos and Doroteo T. Toledano
The Sound Event Detection task aims to determine the temporal locations of acoustic events in audio clips. In recent years, the relevance of this field is rising due to the introduction of datasets such as Google AudioSet or DESED (Domestic Environment S...
ver más
|
|
|
|
|
|
|
Yifan Liu and Jin Zheng
Text-to-speech synthesis is a computational technique for producing synthetic, human-like speech by a computer. In recent years, speech synthesis techniques have developed, and have been employed in many applications, such as automatic translation applic...
ver más
|
|
|
|