Research Article

A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems

Number: 24 April 15, 2021
EN TR

A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems

Abstract

Speech recognition is the transformation of spoken words and sentences into text. There have been many studies on speech recognition in many countries recently. However, studies on speech recognition applications in our country are very few, one of the reasons is the lack of voice dataset. In this study, a Turkish speech database has been developed for Turkish speech recognition based systems. Sound recordings were obtained from news broadcasted by Turkish news tv channels at different times. The created data set was shared on the web in a way that everyone can access in order to set a precedent for other studies. Additionally, the effects of number of layers and number of cells hyperparameters of Long Short Term Memory (LSTM) and Deep Neural Network (DNN) models were investigated on the Turkish Broadcast News Speech Database.

Keywords

References

  1. Bengio, Y., 2009. "Learning Deep Architectures for AI" (PDF). Foundations and Trends in Machine Learning. 1–127.
  2. Gaikwad, S., Gawali, B. W., & Yannawar, P. 2010. A review on Speech Recognition Technique. , pp. 16-24
  3. Graves, A., Mohamed, A. R., & Hinton, G. (2013, May). Speech recognition with deep recurrent neural networks. In 2013 IEEE international conference on acoustics, speech and signal processing (pp. 6645-6649). IEEE.
  4. Graves, A., Jaitly, N., & Mohamed, A. R. (2013b, December). Hybrid speech recognition with deep bidirectional LSTM. In 2013 IEEE workshop on automatic speech recognition and understanding (pp. 273-278). IEEE.
  5. Hizlisoy, S., 2020. Music Emotion Recognition Using Convolutional Long Short Memory Deep Neural Networks.
  6. Patlar, F., 2009. A Continuous Speech Recognition System For Turkish Language Based On Triphone Model.
  7. Sepp Hochreiter; Jürgen Schmidhuber (1997). "LSTM can Solve Hard Long Time Lag Problems". Advances in Neural Information Processing Systems 9. Advances in Neural Information Processing Systems. Wikidata Q77698282.
  8. Tüfekci, Z., and Dokuz, Y., 2020. Investigation of the Effect of LSTM Hyperparameters on Speech Recognition Performance , European Journal of Science and Technology: p. 165.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

April 15, 2021

Submission Date

March 22, 2021

Acceptance Date

April 5, 2021

Published in Issue

Year 2021 Number: 24

APA
Ok, S., & Tüfekci, Z. (2021). A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. Avrupa Bilim Ve Teknoloji Dergisi, 24, 87-92. https://doi.org/10.31590/ejosat.900422
AMA
1.Ok S, Tüfekci Z. A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. EJOSAT. 2021;(24):87-92. doi:10.31590/ejosat.900422
Chicago
Ok, Serhat, and Zekeriya Tüfekci. 2021. “A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems”. Avrupa Bilim Ve Teknoloji Dergisi, nos. 24: 87-92. https://doi.org/10.31590/ejosat.900422.
EndNote
Ok S, Tüfekci Z (April 1, 2021) A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. Avrupa Bilim ve Teknoloji Dergisi 24 87–92.
IEEE
[1]S. Ok and Z. Tüfekci, “A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems”, EJOSAT, no. 24, pp. 87–92, Apr. 2021, doi: 10.31590/ejosat.900422.
ISNAD
Ok, Serhat - Tüfekci, Zekeriya. “A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems”. Avrupa Bilim ve Teknoloji Dergisi. 24 (April 1, 2021): 87-92. https://doi.org/10.31590/ejosat.900422.
JAMA
1.Ok S, Tüfekci Z. A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. EJOSAT. 2021;:87–92.
MLA
Ok, Serhat, and Zekeriya Tüfekci. “A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems”. Avrupa Bilim Ve Teknoloji Dergisi, no. 24, Apr. 2021, pp. 87-92, doi:10.31590/ejosat.900422.
Vancouver
1.Serhat Ok, Zekeriya Tüfekci. A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. EJOSAT. 2021 Apr. 1;(24):87-92. doi:10.31590/ejosat.900422

Cited By