Research Article

Investigation of the Effectiveness of Audio Processing and Filtering Strategies in Noisy Environments on Speech Recognition Performance

Volume: 8 Number: 1 January 17, 2025
TR EN

Investigation of the Effectiveness of Audio Processing and Filtering Strategies in Noisy Environments on Speech Recognition Performance

Abstract

This study investigates the effects of audio processing and filtering strategies to enhance the performance of speech recognition systems in noisy environments. The focus is on the Short-Time Fourier Transform (STFT) operations applied to noisy audio files and noise reduction procedures. While STFT operations form the basis for detecting noise and analyzing the speech signal in the frequency domain, noise reduction steps involve threshold-based masking and convolution operations. The results obtained demonstrate a significant improvement in speech recognition accuracy in noisy environments through audio processing and filtering strategies. A detailed analysis of the graphs provides guidance for evaluating the effectiveness of noise reduction procedures and serves as a roadmap for future research. This study emphasizes the critical importance of audio processing and filtering strategies in improving the performance of speech recognition systems in noisy environments, laying a foundation for future studies.

Keywords

References

  1. Ali MH., Jaber MM., Abd SK., Rehman A., Awan MJ., Vitkutė-Adžgauskienė D., Damaševičius R., Bahaj SA. Harris hawks sparse auto-encoder networks for automatic speech recognition system. Applied Sciences 2022; 12(3): 1091.
  2. Anggriawan DO., Wahjono E., Sudiharto I., Firdaus AA., Putri DNN., Budikarso A. Identification of short duration voltage variations based on short time Fourier transform and artificial neural network. 2020 International Electronics Symposium 2020; 43-47.
  3. Bharti D., Kukana P. A hybrid machine learning model for emotion recognition from speech signals. International Conference on Smart Electronics and Communication (ICOSEC) 2020; 491-496.
  4. Garg K., Jain G. A comparative study of noise reduction techniques for automatic speech recognition systems. International Conference on Advances in Computing, Communications and Informatics (ICACCI) 2016; 2098-2103.
  5. Hamidi M., Satori H., Zealouk O., Satori K. Amazigh digits through interactive speech recognition system in noisy environment. International Journal of Speech Technology 2020; 23(1): 101-109.
  6. Hazrati A., Eftekhari A., Taherian S. A Novel Speech Enhancement Method Based on Deep Residual Network in Low SNR Conditions. 7th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS) 2019; 72-76.
  7. Jurado F., Saenz JR. Comparison between discrete STFT and wavelets for the analysis of power quality events. Electric Power Systems Research 2002; 62(3): 183-190.
  8. Kalamani M., Krishnamoorthi M. Modified least mean square adaptive filter for speech enhancement. Applied Speech Processing 2021; 47-73.

Details

Primary Language

English

Subjects

Deep Learning, Machine Learning (Other)

Journal Section

Research Article

Early Pub Date

January 15, 2025

Publication Date

January 17, 2025

Submission Date

March 22, 2024

Acceptance Date

August 29, 2024

Published in Issue

Year 2025 Volume: 8 Number: 1

APA
Özkurt, C. (2025). Investigation of the Effectiveness of Audio Processing and Filtering Strategies in Noisy Environments on Speech Recognition Performance. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi, 8(1), 222-247. https://doi.org/10.47495/okufbed.1457532
AMA
1.Özkurt C. Investigation of the Effectiveness of Audio Processing and Filtering Strategies in Noisy Environments on Speech Recognition Performance. Osmaniye Korkut Ata University Journal of The Institute of Science and Techno. 2025;8(1):222-247. doi:10.47495/okufbed.1457532
Chicago
Özkurt, Cem. 2025. “Investigation of the Effectiveness of Audio Processing and Filtering Strategies in Noisy Environments on Speech Recognition Performance”. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi 8 (1): 222-47. https://doi.org/10.47495/okufbed.1457532.
EndNote
Özkurt C (January 1, 2025) Investigation of the Effectiveness of Audio Processing and Filtering Strategies in Noisy Environments on Speech Recognition Performance. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi 8 1 222–247.
IEEE
[1]C. Özkurt, “Investigation of the Effectiveness of Audio Processing and Filtering Strategies in Noisy Environments on Speech Recognition Performance”, Osmaniye Korkut Ata University Journal of The Institute of Science and Techno, vol. 8, no. 1, pp. 222–247, Jan. 2025, doi: 10.47495/okufbed.1457532.
ISNAD
Özkurt, Cem. “Investigation of the Effectiveness of Audio Processing and Filtering Strategies in Noisy Environments on Speech Recognition Performance”. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi 8/1 (January 1, 2025): 222-247. https://doi.org/10.47495/okufbed.1457532.
JAMA
1.Özkurt C. Investigation of the Effectiveness of Audio Processing and Filtering Strategies in Noisy Environments on Speech Recognition Performance. Osmaniye Korkut Ata University Journal of The Institute of Science and Techno. 2025;8:222–247.
MLA
Özkurt, Cem. “Investigation of the Effectiveness of Audio Processing and Filtering Strategies in Noisy Environments on Speech Recognition Performance”. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi, vol. 8, no. 1, Jan. 2025, pp. 222-47, doi:10.47495/okufbed.1457532.
Vancouver
1.Cem Özkurt. Investigation of the Effectiveness of Audio Processing and Filtering Strategies in Noisy Environments on Speech Recognition Performance. Osmaniye Korkut Ata University Journal of The Institute of Science and Techno. 2025 Jan. 1;8(1):222-47. doi:10.47495/okufbed.1457532

23487


196541947019414

19433194341943519436 1960219721 197842261021238 23877

*This journal is an international refereed journal 

*Our journal does not charge any article processing fees over publication process.

* This journal is online publishes 5 issues per year (January, March, June, September, December)

*This journal published in Turkish and English as open access. 

19450 This work is licensed under a Creative Commons Attribution 4.0 International License.