Optimized cross-corpus speech emotion recognition framework based on Normalized 1D Convolutional Neural Network
DOI:
https://doi.org/10.6977/IJoSI.202502_9(1).0008Keywords:
Convolutional Neural Networks, Cross-Corpus, Deep Learning, Feature Extraction, Signal Processing, Speech Emotion Recognition, XGB ClassifierAbstract
Human-computer interaction (HCI) improved via voice detection of emotions. Speech Emotion Recognition (SER) software typically detects the appearance of various feelings in the speaker. However, there are significant challenges in combining information from multidisciplinary domains, notably speech-emotion recognition and applied psychology. Some researchers have used handcrafted attributes to categorize emotions and obtained high classification accuracy. However, these attributes reduce the categorization accuracy for multi-lingual environments. Deep learning algorithms have been utilized to autonomously retrieve the local representation from supplied speech data. The given strategies can't extract the most valuable characteristics from challenging speech inputs. To address this constraint, we propose an innovative SER framework that employs data augmentation approaches before generating relevant feature sets from each utterance and selecting the most discriminative optimum features. And the chosen feature vector is sent into the Normalized 1D CNN for emotion recognition using multi-lingual databases. This study evaluates the effectiveness of an XGB classifier for multi-lingual emotion recognition by testing its performance on data from a corpus trained on a different corpus. The testing outcomes displayed that our proposed SER architecture functioned better than existing SER approaches.
Downloads
Published
Issue
Section
License
Copyright in a work is a bundle of rights. IJoSI's, copyright covers what may be done with the work in terms of making copies, making derivative works, abstracting parts of it for citation or quotation elsewhere and so on. IJoSI requires authors to sign over rights when their article is ready for publication so that the publisher from then on owns the work. Until that point, all rights belong to the creator(s) of the work. The format of IJoSI copy right form can be found at the IJoSI web site.The issues of International Journal of Systematic Innovation (IJoSI) are published in electronic format and in print. Our website, journal papers, and manuscripts etc. are stored on one server. Readers can have free online access to our journal papers. Authors transfer copyright to the publisher as part of a journal publishing agreement, but have the right to:
1. Share their article for personal use, internal institutional use and scholarly sharing purposes, with a DOI link to the version of record on our server.
2. Retain patent, trademark and other intellectual property rights (including research data).
3. Proper attribution and credit for the published work.