Araştırma Makalesi

Deep Learning-Based Speech Emotion Recognition for IoT Edge Devices: A Comparative Study

Cilt: 29 Sayı: 4 21 Nisan 2026
PDF İndir
TR EN

Deep Learning-Based Speech Emotion Recognition for IoT Edge Devices: A Comparative Study

Öz

With advancements in artificial intelligence (AI), particularly in pattern recognition, significant progress has been made in recognising human emotions from speech characteristics, facial activity, and physiological responses. However, the expansion of Internet of Things (IoT)-based infrastructures has increased pressure on conventional cloud systems due to the high volume of transmitted data and the need for real-time responsiveness. As a remedy, edge computing has emerged as a distributed alternative, enabling localised data processing and reducing dependency on remote servers. In this context, the present study evaluates the classification performance of three hybrid deep learning (DL) models—Convolutional Neural Network–Dense Neural Network (CNN-Dense), Long Short-Term Memory–Convolutional Neural Network (LSTM-CNN), and Dense–Long Short-Term Memory (Dense-LSTM) —within a simulated edge-based environment. The Toronto Emotional Speech Set (TESS) dataset was employed, and experimental workflows were implemented via Amazon Web Services (AWS) to simulate edge resource limitations. Accuracy was assessed using macro-averaged metrics, including precision, recall, and F1-score. Among the models, CNN-Dense showed the highest performance, achieving an F1-score of 96%, followed by LSTM-CNN (95%) and Dense-LSTM (93%). The findings suggest that CNN–Dense may offer feature extraction advantages, and that hybrid models could be promising for emotion classification in decentralised systems.

Anahtar Kelimeler

Kaynakça

  1. [1] Bonomi, F., Milito, R., Zhu, J., Addepalli, S., “Fog computing and its role in the internet of things” In Proceedings of the first edition of the MCC workshop on Mobile cloud computing (pp. 13-16), (2012).
  2. [2] Al-Turjman, F.,”Intelligence and security in big 5G-oriented IoNT: An overview”, Future Generation Computer Systems, 102, 357-368, (2022).
  3. [3] Mavaddati, S.,” Voice-based age, gender, and language recognition based on ResNet deep model and transfer learning in spectro-temporal domain” Neurocomputing, 580, 127429, (2024).
  4. [4] Nonis, F., Dagnes, N., Marcolin, F., Vezzetti, E., “3D approaches and challenges in facial expression recognition algorithms-a literature review” Applied Sciences, 9(18), 3904 ,(2019).
  5. [5] Vasdev, D., Gupta, V., Shubham, S., Chaudhary, A., Jain, N., Salimi, M., Ahmadian, A.,” Periapical dental X-ray image classification using deep neural networks“, Annals of Operations Research, (2022).
  6. [6] Yang, Z., Li, Z., Zhou, S., Zhang, L., Serikawa, S., “Speech emotion recognition based on multi-feature speed rate and LSTM”, Neurocomputing, 601, 128177, (2024).
  7. [7] Pitaloka, D. A., Wulandari, A., Basaruddin, T., Liliana, D. Y.,” Enhancing CNN with preprocessing stage in automatic emotion recognition” Procedia computer science, 116, 523-529, (2017).
  8. [8] Ng, H. W., Nguyen, V. D., Vonikakis, V., Winkler, S.,” Deep learning for emotion recognition on small datasets using transfer learning” In Proceedings of the 2015 ACM on international conference on multimodal interaction (pp. 443-449), (2015).

Ayrıntılar

Birincil Dil

İngilizce

Konular

Derin Öğrenme, Konuşma Tanıma

Bölüm

Araştırma Makalesi

Erken Görünüm Tarihi

28 Eylül 2025

Yayımlanma Tarihi

21 Nisan 2026

Gönderilme Tarihi

30 Haziran 2025

Kabul Tarihi

12 Eylül 2025

Yayımlandığı Sayı

Yıl 2026 Cilt: 29 Sayı: 4

Kaynak Göster

APA
İşler, B. (2026). Deep Learning-Based Speech Emotion Recognition for IoT Edge Devices: A Comparative Study. Politeknik Dergisi, 29(4), 1-13. https://doi.org/10.2339/politeknik.1729678
AMA
1.İşler B. Deep Learning-Based Speech Emotion Recognition for IoT Edge Devices: A Comparative Study. Politeknik Dergisi. 2026;29(4):1-13. doi:10.2339/politeknik.1729678
Chicago
İşler, Buket. 2026. “Deep Learning-Based Speech Emotion Recognition for IoT Edge Devices: A Comparative Study”. Politeknik Dergisi 29 (4): 1-13. https://doi.org/10.2339/politeknik.1729678.
EndNote
İşler B (01 Nisan 2026) Deep Learning-Based Speech Emotion Recognition for IoT Edge Devices: A Comparative Study. Politeknik Dergisi 29 4 1–13.
IEEE
[1]B. İşler, “Deep Learning-Based Speech Emotion Recognition for IoT Edge Devices: A Comparative Study”, Politeknik Dergisi, c. 29, sy 4, ss. 1–13, Nis. 2026, doi: 10.2339/politeknik.1729678.
ISNAD
İşler, Buket. “Deep Learning-Based Speech Emotion Recognition for IoT Edge Devices: A Comparative Study”. Politeknik Dergisi 29/4 (01 Nisan 2026): 1-13. https://doi.org/10.2339/politeknik.1729678.
JAMA
1.İşler B. Deep Learning-Based Speech Emotion Recognition for IoT Edge Devices: A Comparative Study. Politeknik Dergisi. 2026;29:1–13.
MLA
İşler, Buket. “Deep Learning-Based Speech Emotion Recognition for IoT Edge Devices: A Comparative Study”. Politeknik Dergisi, c. 29, sy 4, Nisan 2026, ss. 1-13, doi:10.2339/politeknik.1729678.
Vancouver
1.Buket İşler. Deep Learning-Based Speech Emotion Recognition for IoT Edge Devices: A Comparative Study. Politeknik Dergisi. 01 Nisan 2026;29(4):1-13. doi:10.2339/politeknik.1729678
 
TARANDIĞIMIZ DİZİNLER (ABSTRACTING / INDEXING)
181341319013191 13189 13187 13188 18016 

download Bu eser Creative Commons Atıf-AynıLisanslaPaylaş 4.0 Uluslararası ile lisanslanmıştır.