EN
TR
A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems
Abstract
Speech recognition is the transformation of spoken words and sentences into text. There have been many studies on speech recognition in many countries recently. However, studies on speech recognition applications in our country are very few, one of the reasons is the lack of voice dataset. In this study, a Turkish speech database has been developed for Turkish speech recognition based systems. Sound recordings were obtained from news broadcasted by Turkish news tv channels at different times. The created data set was shared on the web in a way that everyone can access in order to set a precedent for other studies. Additionally, the effects of number of layers and number of cells hyperparameters of Long Short Term Memory (LSTM) and Deep Neural Network (DNN) models were investigated on the Turkish Broadcast News Speech Database.
Keywords
References
- Bengio, Y., 2009. "Learning Deep Architectures for AI" (PDF). Foundations and Trends in Machine Learning. 1–127.
- Gaikwad, S., Gawali, B. W., & Yannawar, P. 2010. A review on Speech Recognition Technique. , pp. 16-24
- Graves, A., Mohamed, A. R., & Hinton, G. (2013, May). Speech recognition with deep recurrent neural networks. In 2013 IEEE international conference on acoustics, speech and signal processing (pp. 6645-6649). IEEE.
- Graves, A., Jaitly, N., & Mohamed, A. R. (2013b, December). Hybrid speech recognition with deep bidirectional LSTM. In 2013 IEEE workshop on automatic speech recognition and understanding (pp. 273-278). IEEE.
- Hizlisoy, S., 2020. Music Emotion Recognition Using Convolutional Long Short Memory Deep Neural Networks.
- Patlar, F., 2009. A Continuous Speech Recognition System For Turkish Language Based On Triphone Model.
- Sepp Hochreiter; Jürgen Schmidhuber (1997). "LSTM can Solve Hard Long Time Lag Problems". Advances in Neural Information Processing Systems 9. Advances in Neural Information Processing Systems. Wikidata Q77698282.
- Tüfekci, Z., and Dokuz, Y., 2020. Investigation of the Effect of LSTM Hyperparameters on Speech Recognition Performance , European Journal of Science and Technology: p. 165.
Details
Primary Language
English
Subjects
Engineering
Journal Section
Research Article
Publication Date
April 15, 2021
Submission Date
March 22, 2021
Acceptance Date
April 5, 2021
Published in Issue
Year 2021 Number: 24
APA
Ok, S., & Tüfekci, Z. (2021). A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. Avrupa Bilim Ve Teknoloji Dergisi, 24, 87-92. https://doi.org/10.31590/ejosat.900422
AMA
1.Ok S, Tüfekci Z. A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. EJOSAT. 2021;(24):87-92. doi:10.31590/ejosat.900422
Chicago
Ok, Serhat, and Zekeriya Tüfekci. 2021. “A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems”. Avrupa Bilim Ve Teknoloji Dergisi, nos. 24: 87-92. https://doi.org/10.31590/ejosat.900422.
EndNote
Ok S, Tüfekci Z (April 1, 2021) A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. Avrupa Bilim ve Teknoloji Dergisi 24 87–92.
IEEE
[1]S. Ok and Z. Tüfekci, “A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems”, EJOSAT, no. 24, pp. 87–92, Apr. 2021, doi: 10.31590/ejosat.900422.
ISNAD
Ok, Serhat - Tüfekci, Zekeriya. “A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems”. Avrupa Bilim ve Teknoloji Dergisi. 24 (April 1, 2021): 87-92. https://doi.org/10.31590/ejosat.900422.
JAMA
1.Ok S, Tüfekci Z. A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. EJOSAT. 2021;:87–92.
MLA
Ok, Serhat, and Zekeriya Tüfekci. “A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems”. Avrupa Bilim Ve Teknoloji Dergisi, no. 24, Apr. 2021, pp. 87-92, doi:10.31590/ejosat.900422.
Vancouver
1.Serhat Ok, Zekeriya Tüfekci. A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. EJOSAT. 2021 Apr. 1;(24):87-92. doi:10.31590/ejosat.900422
Cited By
VPSA-Based Transfer Function Identification of Single DoF Copter System
International Journal of Aviation Science and Technology
https://doi.org/10.23890/IJAST.vm04is02.0204