EN
Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size
Öz
In the study, a syllable-scale synchronization study was carried out by considering the grammatical structure of Turkish to emphasize simultaneously the sound and the text. Therefore, it was aimed to classify the vowels and consonants in Turkish within the word. For this purpose, two different Artificial Neural Network (ANN) models were preferred for this classification, and also the Mel-Frequency Cepstrum Coefficients method was preferred for extracting features of voice data. It has been observed that ANNs give the best results with deep learning. Tests were made with different numbers of coefficients in feature extraction. In the first stage of this study, a certain number of recordings were taken from the vowels and consonants in Turkish. Then, their feature was extracted and prepared for the training of networks. The best network structure and parameters were selected as a result of training and test made with different parameters. In this training, networks were asked to distinguish vowels from consonants. Afterward, the vowel-consonant distinction was made among 10 predetermined vectors of words and phrases. Layer-recurrent Neural Network and Pattern Recognition Network achieved an average success of 97.43% and 98.04%, respectively, in deep learning training carried out through the Mathworks Matlab software. Because Pattern Recognition Network achieved 98.82% success in recognizing vowels and 97.27% in recognizing consonants, this network model was preferred in vowel-consonant classification. After the classification process, timing files were created by determining the transition times of the vowels in the word. In the last step, an interface was created on the C# .NET platform for the synchronization process, and a syllabic algorithm was developed in this interface to emphasize the syllable synchronization of the text. Thus, the desired high precision was achieved in the simultaneous highlighting of the words.
Anahtar Kelimeler
Kaynakça
- Bayat, H.İ., 2020. Identification of vowel-non vowel letter with artificial neural network and sound-text synchronization at syllable level (Master thesis), Gaziosmanpaşa University, Institute of science and technology, Tokat, Turkey.
- Bengio, Y., 2012. Deep learning of representations for unsupervised and transfer learning. In Proceedings of ICML workshop on unsupervised and transfer learning, pp. 17-36.
- Cakir, E., 2014. Multilabel sound event classification with neural networks. (Master thesis), Tampere University of Technology, Faculty of Computing and Electrical Engineering, Finland.
- Çakır, M.Y., 2017. Real-time high-quality voice recognition. (Master thesis), İstanbul Sabahattin Zaim University, Institute of science and technology, İstanbul, Turkey
- Cosi, P., Bengua, Y. and De Maria, R., 1990. Phonetically-based multi-layered neural networks for vowel classification. Speech Communication, 1(9), pp. 15-19.
- Dave, N., 2013. Feature extraction methods LPC, PLP and MFCC in speech recognition. Internatıonal journal for advance research ın engıneerıng and technology, 4(1), 5 pp.
- Dede, G., 2008. Speech recognition with artificial neural networks (Master thesis), Ankara University, Institute of science and technology, Ankara, Turkey.
- Elman, L. J., 1990. Finding structure in time. Cognitive Science, 2(14), pp. 179-211.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Bilgisayar Yazılımı, Mühendislik
Bölüm
Araştırma Makalesi
Yayımlanma Tarihi
1 Mart 2022
Gönderilme Tarihi
17 Eylül 2021
Kabul Tarihi
18 Kasım 2021
Yayımlandığı Sayı
Yıl 2022 Cilt: 12 Sayı: 1
APA
Onder, M., & Bayat, H. İ. (2022). Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size. Journal of the Institute of Science and Technology, 12(1), 41-57. https://doi.org/10.21597/jist.957879
AMA
1.Onder M, Bayat Hİ. Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size. Iğdır Üniv. Fen Bil Enst. Der. 2022;12(1):41-57. doi:10.21597/jist.957879
Chicago
Onder, Mursel, ve Halil İbrahim Bayat. 2022. “Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size”. Journal of the Institute of Science and Technology 12 (1): 41-57. https://doi.org/10.21597/jist.957879.
EndNote
Onder M, Bayat Hİ (01 Mart 2022) Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size. Journal of the Institute of Science and Technology 12 1 41–57.
IEEE
[1]M. Onder ve H. İ. Bayat, “Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size”, Iğdır Üniv. Fen Bil Enst. Der., c. 12, sy 1, ss. 41–57, Mar. 2022, doi: 10.21597/jist.957879.
ISNAD
Onder, Mursel - Bayat, Halil İbrahim. “Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size”. Journal of the Institute of Science and Technology 12/1 (01 Mart 2022): 41-57. https://doi.org/10.21597/jist.957879.
JAMA
1.Onder M, Bayat Hİ. Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size. Iğdır Üniv. Fen Bil Enst. Der. 2022;12:41–57.
MLA
Onder, Mursel, ve Halil İbrahim Bayat. “Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size”. Journal of the Institute of Science and Technology, c. 12, sy 1, Mart 2022, ss. 41-57, doi:10.21597/jist.957879.
Vancouver
1.Mursel Onder, Halil İbrahim Bayat. Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size. Iğdır Üniv. Fen Bil Enst. Der. 01 Mart 2022;12(1):41-57. doi:10.21597/jist.957879