Assessing the effectiveness of ensembles in Speech Emotion Recognition: Performance analysis under challenging scenarios

Juan-Migual López-Gil, Nestor Garay-Vitoria

2024 - Expert Systems with Applications, Paper No.: 122905

Aldizkariko artikulua

Ikerketa lerroa:
Konputazio Emozionala
Autoreak sinadura ordenaren arabera:
Juan-Migual López-Gil, Nestor Garay-Vitoria
Azalpena:

Speech Emotion Recognition (SER) is an important application in areas such as online gaming, e-learning, and medical care. However, recognizing emotion in speech is computationally difficult since it necessitates a thorough search for feature selection, algorithm hyperparameter tuning, or algorithm combinations, making ensemble use interesting. Although ensembles are frequently employed in SER, their application has not been greatly explored, and their potential benefits for enhancing recognition accuracy and robustness to variability in speech signals have not been fully realized. The purpose of this article is to assess the effectiveness of ensembles in SER by analyzing their performance under challenging scenarios. The experiment made in this study involved evaluating speech samples from various languages, using an out-of-date set of features, and using simple algorithms with default hyperparameters. For classifier set selection, a basic ensemble technique with decision-level voting and a rudimentary heuristic were applied.

The results indicated that basic classifiers significantly improved the SER rate, with an absolute improvement ranging from 0.57% to 9.89%. The suggested ensemble approach outperformed state of the art SER methods, including deep learning-based ones, in terms of recognition rates. The findings justify the use of ensembles in SER applications, particularly in circumstances with insufficient data or out-of-date features and algorithms. The work recommends further investigation of ensembles to enhance recognition accuracy and improve robustness in the face of voice signal variability. Finally, the results of the experiment show that ensembles have the potential to increase SER accuracy, and future research in this field can benefit from the study’s conclusions.

Argitaletxea:
Elsevier
Argitalpen urtea:
2024
ISBN - ISSN:
0957-4174
Fitxategiaren URLa:
https://doi.org/10.1016/j.eswa.2023.122905
Kalitate adierazleak:
WoS (2023) IF: 7.5, Rank: Q1 (6/106 in Operations Research & Management Science)
SJR (2023) IF: 1.875, Rank: Q1 (Artificial Intelligence)
Argitalpenaren izena:
Expert Systems with Applications, Paper No.: 122905
Bolumena:
243