Hybrid Feature Extraction Technique with Data Augmentation for Speech Emotion Recognition Using Deep Learning

Priyanka Joshi

doi:10.52783/jisem.v10i42s.8225

PDF

Published: May 3, 2025

DOI: https://doi.org/10.52783/jisem.v10i42s.8225

Keywords:

Speech emotion recognition (SER); deep learning; machine learning, convolutional neural network.

Priyanka Joshi, Sonika Kandari

Abstract

Prediction of emotions from human speech by machines is termed as speech emotion recognition. Speech is one of the most common and fastest methods of communication between humans. Speech emotion recognition (SER) by machines is a challenging task. Various deep learning algorithms are trying to make machines having such learning capabilities to achieve this task. Several researches are being conducted toward this area but identifying correct emotions from human speech is still challenging. The process of speech emotion recognition consists of three main stages –the feature extraction, feature selection and classification. The feature extraction is considered as the most significant among them. Several researches work has been conducted in the past years and most of them used one technique of feature extraction for training the model. In this paper we have presented hybrid feature extraction technique with data augmentation for an effective emotion recognition model. The simulations are performed by using TESS dataset. From the speech data two sets of features are extracted by using two techniques, with first technique we use mel frequency cepstral coefficient ( MFCC )as feature sets and in the next approach the mel spectrogram images have been extracted The data augmentation technique has been proposed by noise addition to increase the number of samples. Then a Convolutional Neural network (CNN) is implemented for training and testing the model to achieve better accuracy. Our proposed hybrid feature set with data augmentation technique achieved the accuracy of 95.21%.

Issue

Vol. 10 No. 42s (2025)

Section

Articles

Journal of Information Systems Engineering and Management

Hybrid Feature Extraction Technique with Data Augmentation for Speech Emotion Recognition Using Deep Learning

Abstract

Volume 10 (2025)

Volume 9 (2024)

Volume 8 (2023)

Volume 7 (2022)

Volume 6 (2021)

Volume 5 (2020)

Volume 4 (2019)

Volume 3 (2018)

Volume 2 (2017)

Volume 1 (2016)

Journal of Information Systems Engineering and Management

Article Sidebar

Main Article Content

Abstract

Article Details