Review

A Convolutional Neural Network Model Implementation for Speech Recognition

Volume: 7 Number: 3 July 31, 2019
EN TR

A Convolutional Neural Network Model Implementation for Speech Recognition

Abstract

Speech recognition is the capability of an appliance to analyze vocable and diction in a phonetic language and turn them into a machine comprehensible arrangement. It is an interdisciplinary subfield of linguistics, computer science and electrical engineering that establishes processes and techniques that understands and converts speech to text. This paper presents a convolutional neural network model for recognition of speech data.

Keywords

References

  1. [1] K. Davis , R. Biddulph, and S. Balashek “Automatic Recognition of Spoken Digits”, The Journal of the Acoustical Society of America, vol. 24, no. 6 , pp. 637-642, 1952.
  2. [2] S. Das, M. A. Picheny, In Automatic Speech and Speaker Recognition, Boston, USA: Springer, 1996, pp. 457-479
  3. [3] S. Hochreiter, J. Schmidhuber, “Long short-term memory”, Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997
  4. [4] M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean and M. Kudlur “Tensorflow: A System for large-scale machine learning”, 12th Symposium on Operating Systems Design and Implementation (OSDI), Savannah, GA, USA, 2016, pp. 265-283 [5] Tensowflow Speech Commands Data Set v0.01 (2019, 01 April). [Online]. Erişim: https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/data
  5. [6] H. Nyquist, “Certain topics in telegraph transmission theory”, Transactions of the American Institute of Electrical Engineers, vol. 47, no. 2, pp. 617-644, 1928
  6. [7] Davis, Steven, and P. Mermelstein, “Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences”, IEEE transactions on acoustics, speech, and signal processing, vol. 28, no. 4, pp. 357-366, 1980
  7. [8] Slaney, Malcolm, Michele Covell, and B. Lassiter, “Automatic audio morphing”, International Conference on Acoustics, Speech, and Signal Processing Conference (IEEE), 1996, pp. 1001-1004
  8. [9] S. Postalcioglu, “Performance Analysis of Different Optimizers for Deep Learning-Based Image Recognition”, International Journal of Pattern Recognition and Artificial Intelligence, 2019

Details

Primary Language

English

Subjects

Engineering

Journal Section

Review

Publication Date

July 31, 2019

Submission Date

May 20, 2019

Acceptance Date

July 6, 2019

Published in Issue

Year 2019 Volume: 7 Number: 3

APA
Kayıkçı, Ş. (2019). A Convolutional Neural Network Model Implementation for Speech Recognition. Duzce University Journal of Science and Technology, 7(3), 1892-1898. https://doi.org/10.29130/dubited.567828
AMA
1.Kayıkçı Ş. A Convolutional Neural Network Model Implementation for Speech Recognition. DUBİTED. 2019;7(3):1892-1898. doi:10.29130/dubited.567828
Chicago
Kayıkçı, Şafak. 2019. “A Convolutional Neural Network Model Implementation for Speech Recognition”. Duzce University Journal of Science and Technology 7 (3): 1892-98. https://doi.org/10.29130/dubited.567828.
EndNote
Kayıkçı Ş (July 1, 2019) A Convolutional Neural Network Model Implementation for Speech Recognition. Duzce University Journal of Science and Technology 7 3 1892–1898.
IEEE
[1]Ş. Kayıkçı, “A Convolutional Neural Network Model Implementation for Speech Recognition”, DUBİTED, vol. 7, no. 3, pp. 1892–1898, July 2019, doi: 10.29130/dubited.567828.
ISNAD
Kayıkçı, Şafak. “A Convolutional Neural Network Model Implementation for Speech Recognition”. Duzce University Journal of Science and Technology 7/3 (July 1, 2019): 1892-1898. https://doi.org/10.29130/dubited.567828.
JAMA
1.Kayıkçı Ş. A Convolutional Neural Network Model Implementation for Speech Recognition. DUBİTED. 2019;7:1892–1898.
MLA
Kayıkçı, Şafak. “A Convolutional Neural Network Model Implementation for Speech Recognition”. Duzce University Journal of Science and Technology, vol. 7, no. 3, July 2019, pp. 1892-8, doi:10.29130/dubited.567828.
Vancouver
1.Şafak Kayıkçı. A Convolutional Neural Network Model Implementation for Speech Recognition. DUBİTED. 2019 Jul. 1;7(3):1892-8. doi:10.29130/dubited.567828

Cited By