Inception Model for Automatic Arabic Speech Recognition

Zoubir Talaı; Nada Kherıcı

doi:10.55549/epstem.1409606

EN

Inception Model for Automatic Arabic Speech Recognition

Abstract

Reproducing basic human abilities has always been the main purpose for Artificial Intelligence (AI) systems. Since speech is essential to people’s communication, AI was applied to this major field to achieve Automatic Speech Recognition (ASR). In this paper, we focus on the inception model as a solution for Arabic speech recognition, due to its remarkable results on image classification tasks. We adapted this model for ASR problems and tried it on a dataset of spoken Arabic digits collected from social media apps and published corpora which resulted in more than 54000 utterances. A comparison between the proposed model and a traditional Convolutional Neural Network (CNN) shows the superiority of the inception model in ASR tasks. The inception model achieved 99.70% accuracy on the training dataset which is far better than the traditional CNN that achieved 87.46% on the same set, it did also great performance on the test subset with 88.96% accuracy compared to the traditional model with 84.78% recognition rate.

Keywords

References

an, W., Zhang, Z., Zhang, Y., Yu, J., Chiu, C.-C., Qin, J., Gulati, A., Pang, R., & Wu, Y. (2020). ContextNet: improving ocnvolutional neural networks for automatic speech recognition with global context. arXiv.
Hourri, S., Nikolov, N. S., & Kharroubi, J. (2021). Convolutional neural network vectors for speaker recognition. International Journal of Speech Technology, 24(2), 389–400.
Kiranyaz, S., Avci, O., Abdeljaber, O., Ince, T., Gabbouj, M., & Inman, D. J. (2021). 1D convolutional neural networks and applications: A survey. Mechanical Systems and Signal Processing, 151, 107398.

Details

Primary Language

English

Subjects

Automated Software Engineering

Journal Section

Conference Paper

Authors

Zoubir Talaı This is me
Algeria

Nada Kherıcı This is me
Algeria

Early Pub Date

December 25, 2023

Publication Date

December 30, 2023

Submission Date

July 11, 2023

Acceptance Date

November 27, 2023

Published in Issue

Year 2023 Volume: 26

DOI

https://doi.org/10.55549/epstem.1409606

IZ

https://izlik.org/JA96YL99XM

Cite

RIS / Bibtex

APA

Talaı, Z., & Kherıcı, N. (2023). Inception Model for Automatic Arabic Speech Recognition. The Eurasia Proceedings of Science Technology Engineering and Mathematics, 26, 327-331. https://doi.org/10.55549/epstem.1409606

AMA

1.Talaı Z, Kherıcı N. Inception Model for Automatic Arabic Speech Recognition. EPSTEM. 2023;26:327-331. doi:10.55549/epstem.1409606

Chicago

Talaı, Zoubir, and Nada Kherıcı. 2023. “Inception Model for Automatic Arabic Speech Recognition”. The Eurasia Proceedings of Science Technology Engineering and Mathematics 26 (December): 327-31. https://doi.org/10.55549/epstem.1409606.

EndNote

Talaı Z, Kherıcı N (December 1, 2023) Inception Model for Automatic Arabic Speech Recognition. The Eurasia Proceedings of Science Technology Engineering and Mathematics 26 327–331.

IEEE

[1]Z. Talaı and N. Kherıcı, “Inception Model for Automatic Arabic Speech Recognition”, EPSTEM, vol. 26, pp. 327–331, Dec. 2023, doi: 10.55549/epstem.1409606.

ISNAD

Talaı, Zoubir - Kherıcı, Nada. “Inception Model for Automatic Arabic Speech Recognition”. The Eurasia Proceedings of Science Technology Engineering and Mathematics 26 (December 1, 2023): 327-331. https://doi.org/10.55549/epstem.1409606.

JAMA

1.Talaı Z, Kherıcı N. Inception Model for Automatic Arabic Speech Recognition. EPSTEM. 2023;26:327–331.

MLA

Talaı, Zoubir, and Nada Kherıcı. “Inception Model for Automatic Arabic Speech Recognition”. The Eurasia Proceedings of Science Technology Engineering and Mathematics, vol. 26, Dec. 2023, pp. 327-31, doi:10.55549/epstem.1409606.

Vancouver

1.Zoubir Talaı, Nada Kherıcı. Inception Model for Automatic Arabic Speech Recognition. EPSTEM. 2023 Dec. 1;26:327-31. doi:10.55549/epstem.1409606