Research Article

Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size

Volume: 12 Number: 1 March 1, 2022
EN

Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size

Abstract

In the study, a syllable-scale synchronization study was carried out by considering the grammatical structure of Turkish to emphasize simultaneously the sound and the text. Therefore, it was aimed to classify the vowels and consonants in Turkish within the word. For this purpose, two different Artificial Neural Network (ANN) models were preferred for this classification, and also the Mel-Frequency Cepstrum Coefficients method was preferred for extracting features of voice data. It has been observed that ANNs give the best results with deep learning. Tests were made with different numbers of coefficients in feature extraction. In the first stage of this study, a certain number of recordings were taken from the vowels and consonants in Turkish. Then, their feature was extracted and prepared for the training of networks. The best network structure and parameters were selected as a result of training and test made with different parameters. In this training, networks were asked to distinguish vowels from consonants. Afterward, the vowel-consonant distinction was made among 10 predetermined vectors of words and phrases. Layer-recurrent Neural Network and Pattern Recognition Network achieved an average success of 97.43% and 98.04%, respectively, in deep learning training carried out through the Mathworks Matlab software. Because Pattern Recognition Network achieved 98.82% success in recognizing vowels and 97.27% in recognizing consonants, this network model was preferred in vowel-consonant classification. After the classification process, timing files were created by determining the transition times of the vowels in the word. In the last step, an interface was created on the C# .NET platform for the synchronization process, and a syllabic algorithm was developed in this interface to emphasize the syllable synchronization of the text. Thus, the desired high precision was achieved in the simultaneous highlighting of the words.

Keywords

References

  1. Bayat, H.İ., 2020. Identification of vowel-non vowel letter with artificial neural network and sound-text synchronization at syllable level (Master thesis), Gaziosmanpaşa University, Institute of science and technology, Tokat, Turkey.
  2. Bengio, Y., 2012. Deep learning of representations for unsupervised and transfer learning. In Proceedings of ICML workshop on unsupervised and transfer learning, pp. 17-36.
  3. Cakir, E., 2014. Multilabel sound event classification with neural networks. (Master thesis), Tampere University of Technology, Faculty of Computing and Electrical Engineering, Finland.
  4. Çakır, M.Y., 2017. Real-time high-quality voice recognition. (Master thesis), İstanbul Sabahattin Zaim University, Institute of science and technology, İstanbul, Turkey
  5. Cosi, P., Bengua, Y. and De Maria, R., 1990. Phonetically-based multi-layered neural networks for vowel classification. Speech Communication, 1(9), pp. 15-19.
  6. Dave, N., 2013. Feature extraction methods LPC, PLP and MFCC in speech recognition. Internatıonal journal for advance research ın engıneerıng and technology, 4(1), 5 pp.
  7. Dede, G., 2008. Speech recognition with artificial neural networks (Master thesis), Ankara University, Institute of science and technology, Ankara, Turkey.
  8. Elman, L. J., 1990. Finding structure in time. Cognitive Science, 2(14), pp. 179-211.

Details

Primary Language

English

Subjects

Computer Software, Engineering

Journal Section

Research Article

Publication Date

March 1, 2022

Submission Date

September 17, 2021

Acceptance Date

November 18, 2021

Published in Issue

Year 2022 Volume: 12 Number: 1

APA
Onder, M., & Bayat, H. İ. (2022). Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size. Journal of the Institute of Science and Technology, 12(1), 41-57. https://doi.org/10.21597/jist.957879
AMA
1.Onder M, Bayat Hİ. Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size. J. Inst. Sci. and Tech. 2022;12(1):41-57. doi:10.21597/jist.957879
Chicago
Onder, Mursel, and Halil İbrahim Bayat. 2022. “Classification Vowel-Consonant Letters With Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size”. Journal of the Institute of Science and Technology 12 (1): 41-57. https://doi.org/10.21597/jist.957879.
EndNote
Onder M, Bayat Hİ (March 1, 2022) Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size. Journal of the Institute of Science and Technology 12 1 41–57.
IEEE
[1]M. Onder and H. İ. Bayat, “Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size”, J. Inst. Sci. and Tech., vol. 12, no. 1, pp. 41–57, Mar. 2022, doi: 10.21597/jist.957879.
ISNAD
Onder, Mursel - Bayat, Halil İbrahim. “Classification Vowel-Consonant Letters With Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size”. Journal of the Institute of Science and Technology 12/1 (March 1, 2022): 41-57. https://doi.org/10.21597/jist.957879.
JAMA
1.Onder M, Bayat Hİ. Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size. J. Inst. Sci. and Tech. 2022;12:41–57.
MLA
Onder, Mursel, and Halil İbrahim Bayat. “Classification Vowel-Consonant Letters With Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size”. Journal of the Institute of Science and Technology, vol. 12, no. 1, Mar. 2022, pp. 41-57, doi:10.21597/jist.957879.
Vancouver
1.Mursel Onder, Halil İbrahim Bayat. Classification Vowel-Consonant Letters with Deep Neural Networks in Turkish and Text-Voice Synchronization on a Basis Syllable Size. J. Inst. Sci. and Tech. 2022 Mar. 1;12(1):41-57. doi:10.21597/jist.957879