REVISTA
Information

TODAS

Inicio / Information / Vol: 13 Par: 10 (2022) / Artículo

ARTÍCULO

TITULO

Empirical Comparison between Deep and Classical Classifiers for Speaker Verification in Emotional Talking Environments

Ali Bou Nassif

Ismail Shahin

Mohammed Lataifeh

Ashraf Elnagar and Nawel Nemmour

Resumen

Speech signals carry various bits of information relevant to the speaker such as age, gender, accent, language, health, and emotions. Emotions are conveyed through modulations of facial and vocal expressions. This paper conducts an empirical comparison of performances between the classical classifiers: Gaussian Mixture Model (GMM), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Artificial neural networks (ANN); and the deep learning classifiers, i.e., Long Short-Term Memory (LSTM), Convolutional Neural Network (CNN), and Gated Recurrent Unit (GRU) in addition to the ivector approach for a text-independent speaker verification task in neutral and emotional talking environments. The deep models undergo hyperparameter tuning using the Grid Search optimization algorithm. The models are trained and tested using a private Arabic Emirati Speech Database, Ryerson Audio?Visual Database of Emotional Speech and Song dataset (RAVDESS) database, and a public Crowd-Sourced Emotional Multimodal Actors (CREMA) database. Experimental results illustrate that deep architectures do not necessarily outperform classical classifiers. In fact, evaluation was carried out through Equal Error Rate (EER) along with Area Under the Curve (AUC) scores. The findings reveal that the GMM model yields the lowest EER values and the best AUC scores across all datasets, amongst classical classifiers. In addition, the ivector model surpasses all the fine-tuned deep models (CNN, LSTM, and GRU) based on both evaluation metrics in the neutral, as well as the emotional speech. In addition, the GMM outperforms the ivector using the Emirati and RAVDESS databases.

Palabras claves

classical classifiers - deep neural network - emotional speech - feature extraction - speaker verification

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 13 Parte: 10 (2022)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Water
Inteligencia Artificial
Journal of Marine Science and Engineering

DOI

https://doi.org/10.3390/info13100456

Artículos similares

Large Eddy Simulations of Flow Past Circular Cylinders to Determine Head Loss Coefficients of Circular Bar Trash Racks with Perpendicular Inflow Conditions

Acceso

Hannes Zöschg

Trash racks installed at hydropower plants cause head losses that reduce energy output. Previous research has thoroughly investigated head losses through both experimental and field studies. However, only a limited number of numerical studies have been p... ver más

Revista: Water

An Empirical Comparison of Interpretable Models to Post-Hoc Explanations

Acceso

Parisa Mahya and Johannes Fürnkranz

Recently, some effort went into explaining intransparent and black-box models, such as deep neural networks or random forests. So-called model-agnostic methods typically approximate the prediction of the intransparent black-box model with an interpretabl... ver más

Revista: AI

A New Method of Ship Type Identification Based on Underwater Radiated Noise Signals

Acceso

Shanshan Chen, Sheng Guan, Hui Wang, Ningqi Ye and Zexun Wei

Ship type identification is an important basis for ship management and monitoring. The paper proposed a new method of ship type identification by combining characteristic parameters from the energy difference between high and low frequencies and the sens... ver más

Revista: Journal of Marine Science and Engineering

Comparison of Semi-Empirical Impedance Models for Locally-Reacting Acoustic Liners in a Wide Range of Sound Pressure Levels

Acceso

Vadim Palchikovskiy, Aleksandr Kuznetsov, Igor Khramtsov and Oleg Kustov

A comparison is considered of the experimentally obtained impedance of locally reacting acoustic liner samples with the impedance calculated using semi-empirical Goodrich, Sobolev and Eversman models. The semi-empirical impedance models are outlined. In ... ver más

Revista: Acoustics

Map Matching Based on Seq2Seq with Topology Information

Acceso

Yulong Bai, Guolian Li, Tianxiu Lu, Yadong Wu, Weihan Zhang and Yidan Feng

Most existing road network matching algorithms are designed based on previous rules and do not fully utilize the potential of big data and historical tracks. To solve this problem, we introduce a new road network matching algorithm based on deep learning... ver más

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas