Research Article

Feed-Forward Deep Neural Network Model Based Speech Recognition System for Speech Signal

Volume: 9 Number: 3 June 30, 2026
EN

Feed-Forward Deep Neural Network Model Based Speech Recognition System for Speech Signal

Abstract

This research work aims to enhance speech recognition accuracy and system generalization performance by optimizing deep neural network (DNN) Systems. The Experiments are conducted using a standard benchmark speech dataset and an independent real-time speech dataset while following a complete speaker-independent assessment method. The baseline model uses a feed-forward DNN, which researchers improve through Genetic Algorithm (GA) and Particle Swarm Optimization (PSO) and Whale Optimization Algorithm (WOA), and the proposed Neural Whale Optimization Algorithm (NOWOA). Comprehensive evaluations, including confusion matrix-based metrics, 5-fold cross-validation, and overfitting analysis, are performed to assess robustness and reliability. Experimental results demonstrate that the baseline DNN achieves approximately 50\% recognition accuracy, while optimization significantly enhances performance. The proposed NOWOA-optimized DNN system achieves the highest recognition accuracy of 99.36\% among all tested methods, proving its effectiveness for speech recognition tasks on both standard and real-time datasets.

Keywords

Ethical Statement

It is declared that during the preparation process of this study, scientific and ethical principles were followed.

References

  1. L. S. Hussein and S. A. Mahmood, Kurdish Speech-to-Text Recognition System Based on Deep Convolutional-Recurrent Neural Networks, UHD Journal of Science and Technology, vol. 6, no. 2, pp. 117–125, 2022. doi:10.21928/uhdjst.v6n2y2022.pp117-125.
  2. D. Dutta, R. D. Choudhury, and U. Barman, Assamese Speech-Based Terminology Identification System using Convolutional Neural Network, International Journal of Computing and Digital Systems, vol. 12, no. 1, pp. 1191–1202, 2022. doi:10.12785/ijcds/120195.
  3. M. Al Dabel, Speech Attribute Detection to Recognize Arabic Broadcast Speech in Industrial Networks, Mobile Information Systems, vol. 2022, pp. 1–13, 2022. doi:10.1155/2022/3732442.
  4. M. K. Singh, Speaker Identification using MFCC Feature Extraction ANN Classification Technique, Wireless Personal Communications, vol. 136, no. 1, pp. 453–467, 2024. doi:10.1007/s11277-024-11282-1.
  5. S. S. Nagineni, K. C. Krishna, M. Harish, and B. Nayak, Implementation of Supervised Speech Enhancement Model Using Fully Connected Feed-Forward Networks, International Journal of Innovative Science, Engineering & Technology, vol. 9, pp. 2348–7968, 2022.
  6. Y. Choi, J. Jang, and M. W. Koo, A Korean Menu-Ordering Sentence Text-to-Speech System Using Conformer-Based FastSpeech2, The Journal of the Acoustical Society of Korea, vol. 41, no. 3, pp. 359–366, 2022. doi:10.7776/ASK.2022.41.3.359.
  7. M. K. Singh, Identification of Speaker from Disguised Voice using MFCC Feature Extraction, Chi-Square and Classification Technique, Wireless Personal Communications, vol. 138, no. 2, pp. 973–987, 2024. doi:10.1007/s11277-024-11542-0.
  8. G. Chakraborty, M. Sharma, N. Saikia, and K. K. Sarma, Soft-Computation-Based Speech Recognition System for Sylheti Language, International Journal of Speech Technology, vol. 25, no. 2, pp. 499–509, 2022. doi:10.1007/s10772-022-09976-7.

Details

Primary Language

English

Subjects

Artificial Intelligence (Other)

Journal Section

Research Article

Early Pub Date

June 25, 2026

Publication Date

June 30, 2026

Submission Date

November 8, 2025

Acceptance Date

March 30, 2026

Published in Issue

Year 2026 Volume: 9 Number: 3

APA
Singh, M. K., Kumar, S., & Ranjan, R. (2026). Feed-Forward Deep Neural Network Model Based Speech Recognition System for Speech Signal. Sakarya University Journal of Computer and Information Sciences, 9(3), 920-933. https://doi.org/10.35377/saucis...1819908
AMA
1.Singh MK, Kumar S, Ranjan R. Feed-Forward Deep Neural Network Model Based Speech Recognition System for Speech Signal. SAUCIS. 2026;9(3):920-933. doi:10.35377/saucis.1819908
Chicago
Singh, Mahesh K., Sanjeev Kumar, and Rajeev Ranjan. 2026. “Feed-Forward Deep Neural Network Model Based Speech Recognition System for Speech Signal”. Sakarya University Journal of Computer and Information Sciences 9 (3): 920-33. https://doi.org/10.35377/saucis. 1819908.
EndNote
Singh MK, Kumar S, Ranjan R (June 1, 2026) Feed-Forward Deep Neural Network Model Based Speech Recognition System for Speech Signal. Sakarya University Journal of Computer and Information Sciences 9 3 920–933.
IEEE
[1]M. K. Singh, S. Kumar, and R. Ranjan, “Feed-Forward Deep Neural Network Model Based Speech Recognition System for Speech Signal”, SAUCIS, vol. 9, no. 3, pp. 920–933, June 2026, doi: 10.35377/saucis...1819908.
ISNAD
Singh, Mahesh K. - Kumar, Sanjeev - Ranjan, Rajeev. “Feed-Forward Deep Neural Network Model Based Speech Recognition System for Speech Signal”. Sakarya University Journal of Computer and Information Sciences 9/3 (June 1, 2026): 920-933. https://doi.org/10.35377/saucis. 1819908.
JAMA
1.Singh MK, Kumar S, Ranjan R. Feed-Forward Deep Neural Network Model Based Speech Recognition System for Speech Signal. SAUCIS. 2026;9:920–933.
MLA
Singh, Mahesh K., et al. “Feed-Forward Deep Neural Network Model Based Speech Recognition System for Speech Signal”. Sakarya University Journal of Computer and Information Sciences, vol. 9, no. 3, June 2026, pp. 920-33, doi:10.35377/saucis. 1819908.
Vancouver
1.Mahesh K. Singh, Sanjeev Kumar, Rajeev Ranjan. Feed-Forward Deep Neural Network Model Based Speech Recognition System for Speech Signal. SAUCIS. 2026 Jun. 1;9(3):920-33. doi:10.35377/saucis. 1819908

 

INDEXING & ABSTRACTING & ARCHIVING

 

31045 31044   ResimLink - Resim Yükle  31047 

31043 28939 28938 34240
 

 

29070    The papers in this journal are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License