Lip Reading Using Various Deep Learning Models with Visual Turkish Data
Abstract
Keywords
Supporting Institution
Project Number
Thanks
References
- [1] Fisher, C. G., “Confusions among visually perceived consonants”, Journal of Speech, Language, and Hearing Research, 11(4): 796–804, (1968).
- [2] Easton, R. D., and Basala, M., “Perceptual dominance during lipreading”, Perception and Psychophysics, 32(6): 562–570, (1982).
- [3] Lesani, F. S., Ghazvini, F. F., and Dianat, R., “Mobile phone security using automatic lip reading", 9th International Conference on e-Commerce in Developing Countries: With focus on e-Business, Isfahan, Iran, 2015, 1-5, (2015).
- [4] Mathulaprangsan, S., Wang, C. Y., Frisky, A. Z. K., Tai, T. C., and Wang, J. C., “A survey of visual lip reading and lip-password verification”, International Conference on Orange Technologies (ICOT), Hong Kong, China, 22-25, (2015).
- [5] Bahdanau, D., Chorowski J., Serdyuk D., Brakel P., and Bengio Y., “End-to-end attention-based large vocabulary speech recognition”, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, Shanghai, China, 4945-4949, (2016).
- [6] Huang, J. T., Li, J., and Gong, Y., “An analysis of convolutional neural networks for speech recognition”, IEEE International Conference on Acoustics, Speech and Signal Processing, South Brisbane, QLD, Australia, 4989–4993, (2015).
- [7] Miao, Y., Gowayyed, M., Metze, and F., “EESEN: End-to-end speech recognition using deep RNN models and WFSTbased decoding”, IEEE Workshop on Automatic Speech Recognition and Understanding, 167–174, (2016).
- [8] Hyunmin, C., Kang, C. M., Kim, B., Kim, J., Chung, C. C., and Choi, W., “Autonomous Braking System via Deep Reinforcement Learning”, ArXiv, abs/1702.02302, (2017).
Details
Primary Language
English
Subjects
Engineering
Journal Section
Research Article
Authors
Ali Berkol
*
0000-0002-3056-1226
Türkiye
Talya Tümer Sivri
0000-0003-1813-5539
Türkiye
Hamit Erdem
0000-0003-1704-1581
Türkiye
Early Pub Date
January 15, 2024
Publication Date
September 1, 2024
Submission Date
January 19, 2023
Acceptance Date
November 15, 2023
Published in Issue
Year 2024 Volume: 37 Number: 3
Cited By
Some New Techniques of Computing Correlation Coefficient between q-Rung Orthopair Fuzzy Sets and their Applications in Multi-Criteria Decision-Making
Gazi University Journal of Science
https://doi.org/10.35378/gujs.1420424Challenges and enhancements in Turkish automatic lip reading using deep learning models
Signal, Image and Video Processing
https://doi.org/10.1007/s11760-026-05252-2