REVISTA
Applied Sciences

TODAS

Redirigiendo al acceso original de articulo en 23 segundos...

Inicio / Applied Sciences / Vol: 10 Par: 11 (2020) / Artículo

ARTÍCULO

TITULO

Multiresolution Speech Enhancement Based on Proposed Circular Nested Microphone Array in Combination with Sub-Band Affine Projection Algorithm

Ali Dehghan Firoozabadi

Pablo Irarrazaval

Pablo Adasme

David Zabala-Blanco

Hugo Durney

Miguel Sanhueza

Pablo Palacios-Játiva and Cesar Azurdia-Meza

Resumen

Speech enhancement is one of the most important fields in audio and speech signal processing. The speech enhancement methods are divided into the single and multi-channel algorithms. The multi-channel methods increase the speech enhancement performance by providing more information with the use of more microphones. In addition, spatial aliasing is one of the destructive factors in speech enhancement strategies. In this article, we first propose a uniform circular nested microphone array (CNMA) for data recording. The microphone array increases the accuracy of the speech processing methods by increasing the information. Moreover, the proposed nested structure eliminates the spatial aliasing between microphone signals. The circular shape in the proposed nested microphone array implements the speech enhancement algorithm with the same probability for the speakers in all directions. In addition, the speech signal information is different in frequency bands, where the sub-band processing is proposed by the use of the analysis filter bank. The frequency resolution is increased in low frequency components by implementing the proposed filter bank. Then, the affine projection algorithm (APA) is implemented as an adaptive filter on sub-bands that were obtained by the proposed nested microphone array and analysis filter bank. This algorithm adaptively enhances the noisy speech signal. Next, the synthesis filters are implemented for reconstructing the enhanced speech signal. The proposed circular nested microphone array in combination with the sub-band affine projection algorithm (CNMA-SBAPA) is compared with the least mean square (LMS), recursive least square (RLS), traditional APA, distributed multichannel Wiener filter (DB-MWF), and multichannel nonnegative matrix factorization-minimum variance distortionless response (MNMF-MVDR) in terms of the segmental signal-to-noise ratio (SegSNR), perceptual evaluation of speech quality (PESQ), mean opinion score (MOS), short-time objective intelligibility (STOI), and speed of convergence on real and simulated data for white and colored noises. In all scenarios, the proposed method has high accuracy at different levels and noise types by the lower distortion in comparison with other works and, furthermore, the speed of convergence is higher than the compared researches.

Palabras claves

speech enhancement - adaptive filter - microphone array - sub-band processing - filter bank

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 10 Parte: 11 (2020)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Aerospace
Applied Sciences
Algorithms

DOI

https://doi.org/10.3390/app10113955

Artículos similares

Predicting Individual Well-Being in Teamwork Contexts Based on Speech Features

Acceso

Tobias Zeulner, Gerhard Johann Hagerer, Moritz Müller, Ignacio Vazquez and Peter A. Gloor

Current methods for assessing individual well-being in team collaboration at the workplace often rely on manually collected surveys. This limits continuous real-world data collection and proactive measures to improve team member workplace satisfaction. W... ver más

Revista: Information

Dementia Detection from Speech: What If Language Models Are Not the Answer?

Acceso

Mondher Bouazizi, Chuheng Zheng, Siyuan Yang and Tomoaki Ohtsuki

A growing focus among scientists has been on researching the techniques of automatic detection of dementia that can be applied to the speech samples of individuals with dementia. Leveraging the rapid advancements in Deep Learning (DL) and Natural Languag... ver más

Revista: Information

The Use of Correlation Features in the Problem of Speech Recognition

Acceso

Nikita Andriyanov

The problem solved in the article is connected with the increase in the efficiency of phraseological radio exchange message recognition, which sometimes takes place in conditions of increased tension for the pilot. For high-quality recognition, signal pr... ver más

Revista: Algorithms

Speech Enhancement Based on Two-Stage Processing with Deep Neural Network for Laser Doppler Vibrometer

Acceso

Chengkai Cai, Kenta Iwai and Takanobu Nishiura

The development of distant-talk measurement systems has been attracting attention since they can be applied to many situations such as security and disaster relief. One such system that uses a device called a laser Doppler vibrometer (LDV) to acquire sou... ver más

Revista: Applied Sciences

Cognitive Load Assessment of Air Traffic Controller Based on SCNN-TransE Network Using Speech Data

Acceso

Jing Yang, Hongyu Yang, Zhengyuan Wu and Xiping Wu

Due to increased air traffic flow, air traffic controllers (ATCs) operate in a state of high load or even overload for long periods of time, which can seriously affect the reliability and efficiency of controllers? commands. Thus, the early identificatio... ver más

Revista: Aerospace

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas