EN
TR
A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems
Öz
Speech recognition is the transformation of spoken words and sentences into text. There have been many studies on speech recognition in many countries recently. However, studies on speech recognition applications in our country are very few, one of the reasons is the lack of voice dataset. In this study, a Turkish speech database has been developed for Turkish speech recognition based systems. Sound recordings were obtained from news broadcasted by Turkish news tv channels at different times. The created data set was shared on the web in a way that everyone can access in order to set a precedent for other studies. Additionally, the effects of number of layers and number of cells hyperparameters of Long Short Term Memory (LSTM) and Deep Neural Network (DNN) models were investigated on the Turkish Broadcast News Speech Database.
Anahtar Kelimeler
Kaynakça
- Bengio, Y., 2009. "Learning Deep Architectures for AI" (PDF). Foundations and Trends in Machine Learning. 1–127.
- Gaikwad, S., Gawali, B. W., & Yannawar, P. 2010. A review on Speech Recognition Technique. , pp. 16-24
- Graves, A., Mohamed, A. R., & Hinton, G. (2013, May). Speech recognition with deep recurrent neural networks. In 2013 IEEE international conference on acoustics, speech and signal processing (pp. 6645-6649). IEEE.
- Graves, A., Jaitly, N., & Mohamed, A. R. (2013b, December). Hybrid speech recognition with deep bidirectional LSTM. In 2013 IEEE workshop on automatic speech recognition and understanding (pp. 273-278). IEEE.
- Hizlisoy, S., 2020. Music Emotion Recognition Using Convolutional Long Short Memory Deep Neural Networks.
- Patlar, F., 2009. A Continuous Speech Recognition System For Turkish Language Based On Triphone Model.
- Sepp Hochreiter; Jürgen Schmidhuber (1997). "LSTM can Solve Hard Long Time Lag Problems". Advances in Neural Information Processing Systems 9. Advances in Neural Information Processing Systems. Wikidata Q77698282.
- Tüfekci, Z., and Dokuz, Y., 2020. Investigation of the Effect of LSTM Hyperparameters on Speech Recognition Performance , European Journal of Science and Technology: p. 165.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Mühendislik
Bölüm
Araştırma Makalesi
Yayımlanma Tarihi
15 Nisan 2021
Gönderilme Tarihi
22 Mart 2021
Kabul Tarihi
5 Nisan 2021
Yayımlandığı Sayı
Yıl 2021 Sayı: 24
APA
Ok, S., & Tüfekci, Z. (2021). A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. Avrupa Bilim ve Teknoloji Dergisi, 24, 87-92. https://doi.org/10.31590/ejosat.900422
AMA
1.Ok S, Tüfekci Z. A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. EJOSAT. 2021;(24):87-92. doi:10.31590/ejosat.900422
Chicago
Ok, Serhat, ve Zekeriya Tüfekci. 2021. “A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems”. Avrupa Bilim ve Teknoloji Dergisi, sy 24: 87-92. https://doi.org/10.31590/ejosat.900422.
EndNote
Ok S, Tüfekci Z (01 Nisan 2021) A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. Avrupa Bilim ve Teknoloji Dergisi 24 87–92.
IEEE
[1]S. Ok ve Z. Tüfekci, “A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems”, EJOSAT, sy 24, ss. 87–92, Nis. 2021, doi: 10.31590/ejosat.900422.
ISNAD
Ok, Serhat - Tüfekci, Zekeriya. “A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems”. Avrupa Bilim ve Teknoloji Dergisi. 24 (01 Nisan 2021): 87-92. https://doi.org/10.31590/ejosat.900422.
JAMA
1.Ok S, Tüfekci Z. A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. EJOSAT. 2021;:87–92.
MLA
Ok, Serhat, ve Zekeriya Tüfekci. “A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems”. Avrupa Bilim ve Teknoloji Dergisi, sy 24, Nisan 2021, ss. 87-92, doi:10.31590/ejosat.900422.
Vancouver
1.Serhat Ok, Zekeriya Tüfekci. A Turkish Broadcast News Speech Database for Investigation the Effect of Deep Neural Network and Long Short Term Memory Hyperparameters on Speech Recognition Based Systems. EJOSAT. 01 Nisan 2021;(24):87-92. doi:10.31590/ejosat.900422
Cited By
VPSA-Based Transfer Function Identification of Single DoF Copter System
International Journal of Aviation Science and Technology
https://doi.org/10.23890/IJAST.vm04is02.0204