EN
A Modified MFCC-Based Deep Learning Method for Emotion Classification from Speech
Abstract
Speech, which is one of the most effective methods of communication, varies according to the emotions experienced by people and includes not only vocabulary but also information about emotions. With developing technologies, human-machine interaction is also improving. Emotional information to be extracted from voice signals is valuable for this interaction. For these reasons, studies on emotion recognition systems are increasing. In this study, sentiment analysis is performed using the Toronto Emotional Speech Set (TESS) created by University of Toronto. The voice data in the dataset is first preprocessed and then a new CNN-based deep learning method on it is compared. The voice files in the TESS dataset have been first obtained feature maps using the MFCC method, and then classification has been performed with this method based on the proposed neural network model. Separate models have been created with CNN and LSTM models for the classification process. The experiments show that the MFCC-applied CNN model achieves a better result with an accuracy of 99.5% than the existing methods for the classification of voice signals. The accuracy value of the CNN model shows that the proposed CNN model can be used for emotion classification from human voice data.
Keywords
References
- 1. Liu, K., Wang, D., Wu, D., Liu, Y., and Feng, J., Speech emotion recognition via multi-level attention network. IEEE Signal Processing Letters, 2022. 29: p. 2278-2282.
- 2. Venkataramanan, K., and Rajamohan, H. R., Emotion recognition from speech. arXiv preprint, 2019. p. 1912-10458.
- 3. Aydin, M., Tuğrul, B., and Yilmaz, A. R., Emotion Recognition System from Speech using Convolutional Neural Networks. Computer Science, 2022. p. 137-143.
- 4. Xu, Y., English speech recognition and evaluation of pronunciation quality using deep learning. Mobile Information Systems, 2022. p. 1-12.
- 5. Donuk, K., and Hanbay, D., Konuşma Duygu Tanıma için Akustik Özelliklere Dayalı LSTM Tabanlı Bir Yaklaşım. Computer Science, 2022. 7(2): p. 54-67.
- 6. Akinpelu, S., and Viriri, S., Robust Feature Selection-Based Speech Emotion Classification Using Deep Transfer Learning. Applied Sciences, 2022. 12(16): 8265.
- 7. Patel, N., Patel, S., and Mankad, S. H., Impact of autoencoder based compact representation on emotion detection from audio. Journal of Ambient Intelligence and Humanized Computing, 2022. p. 1-19.
- 8. Asiya, U. A., and Kiran, V. K., Speech Emotion Recognition-A Deep Learning Approach, in Fifth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud)(I-SMAC). 2021. Palladam, India: p. 867-871.
Details
Primary Language
English
Subjects
Computer Software, Electrical Engineering (Other)
Journal Section
Research Article
Early Pub Date
June 5, 2024
Publication Date
April 20, 2024
Submission Date
October 9, 2023
Acceptance Date
March 14, 2024
Published in Issue
Year 2024 Volume: 8 Number: 1
APA
Şengül, F., & Akkaya, S. (2024). A Modified MFCC-Based Deep Learning Method for Emotion Classification from Speech. International Advanced Researches and Engineering Journal, 8(1), 33-42. https://doi.org/10.35860/iarej.1373333
AMA
1.Şengül F, Akkaya S. A Modified MFCC-Based Deep Learning Method for Emotion Classification from Speech. Int. Adv. Res. Eng. J. 2024;8(1):33-42. doi:10.35860/iarej.1373333
Chicago
Şengül, Fatih, and Sıtkı Akkaya. 2024. “A Modified MFCC-Based Deep Learning Method for Emotion Classification from Speech”. International Advanced Researches and Engineering Journal 8 (1): 33-42. https://doi.org/10.35860/iarej.1373333.
EndNote
Şengül F, Akkaya S (April 1, 2024) A Modified MFCC-Based Deep Learning Method for Emotion Classification from Speech. International Advanced Researches and Engineering Journal 8 1 33–42.
IEEE
[1]F. Şengül and S. Akkaya, “A Modified MFCC-Based Deep Learning Method for Emotion Classification from Speech”, Int. Adv. Res. Eng. J., vol. 8, no. 1, pp. 33–42, Apr. 2024, doi: 10.35860/iarej.1373333.
ISNAD
Şengül, Fatih - Akkaya, Sıtkı. “A Modified MFCC-Based Deep Learning Method for Emotion Classification from Speech”. International Advanced Researches and Engineering Journal 8/1 (April 1, 2024): 33-42. https://doi.org/10.35860/iarej.1373333.
JAMA
1.Şengül F, Akkaya S. A Modified MFCC-Based Deep Learning Method for Emotion Classification from Speech. Int. Adv. Res. Eng. J. 2024;8:33–42.
MLA
Şengül, Fatih, and Sıtkı Akkaya. “A Modified MFCC-Based Deep Learning Method for Emotion Classification from Speech”. International Advanced Researches and Engineering Journal, vol. 8, no. 1, Apr. 2024, pp. 33-42, doi:10.35860/iarej.1373333.
Vancouver
1.Fatih Şengül, Sıtkı Akkaya. A Modified MFCC-Based Deep Learning Method for Emotion Classification from Speech. Int. Adv. Res. Eng. J. 2024 Apr. 1;8(1):33-42. doi:10.35860/iarej.1373333
Cited By
Merger of internet of things and machine learning: The internet of everything sector projects, benefits, and future roles
International Advanced Researches and Engineering Journal
https://doi.org/10.35860/iarej.1467110Enhancing water quality prediction with artificial intelligence: A hybrid convlstm model for spatio-temporal analysis
International Advanced Researches and Engineering Journal
https://doi.org/10.35860/iarej.1679575
