Research Article

Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM)

Volume: 2 Number: 2 December 30, 2019
TR EN

Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM)

Abstract

In this paper, a new Turkish spoken number recognition system proposed. The Mel-frequency cepstral coefficients (MFCC) algorithm used as a feature extraction method, the Gaussian Hidden Markov model, used for numbers phonemes modeling where each number has a Markov model. The system trained on a dataset collected from 20 subjects that includes 7 females and 13 males. Each one says the Turkish numbers from “zero” to “ten”. Audio files sampled at 8000Hz at each second and each file has one-second length and recorded in an isolated environment. We tested the system using random records for different people. The training files include 220 audio record and testing files include 18 audio record. The system achieves %83.3 accuracy, %86 precision, and %83 recall rates.

Keywords

References

  1. Rabiner, Lawrence R., and Biing-Hwang Juang. Fundamentals of speech recognition. Vol. 14. Englewood Cliffs: PTR Prentice Hall, 1993.
  2. Deller, John R., John HL Hansen, and John G. Proakis. "Discrete-time processing of speech signals." (2000): 595-602
  3. Motlıcek, Petr. Feature extraction in speech coding and recognition. Technical Report of PhD research internship in ASP Group, OGI-OHSU, http://www. fit. vutbr. cz/∼ motlicek/publi/2002/rep ogi. pdf, 2002
  4. Hagen, Andreas, Daniel A. Connors, and Bryan L. Pellom. "The analysis and design of architecture systems for speech recognition on modern handheld-computing devices." Proceedings of the 1st IEEE/ACM/IFIP international conference on Hardware/ Software codesign and system synthesis. ACM, 2003
  5. Ishizuka, Kentaro, and Tomohiro Nakatani. "A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition." Speech communication 48.11 (2006): 1447-1457
  6. Xu, Min, et al. "HMM-based audio keyword generation." Pacific-Rim Conference on Multimedia. Springer, Berlin, Heidelberg, 2004
  7. G. Evermann, H. Y. Chan, M. J. F. Gales, T. Hain, X. Liu, D. Mrva, L. Wang,and P. Woodland, “Development of the 2003 CU-HTK conversational telephone speech transcription system,” in Proceedings of ICASSP, Montreal, Canada, 2004
  8. S. Matsoukas, J.-L. Gauvain, A. Adda, T. Colthurst, C. I. Kao, O. Kimball, L. Lamel, F. Lefevre, J. Z. Ma, J. Makhoul, L. Nguyen, R. Prasad, R. Schwartz, H. Schwenk, and B. Xiang, “Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system,” IEEE Transactions on Audio, Speech and Language Processing, vol. 14, no. 5, pp. 1541–1556, September 2006

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

December 30, 2019

Submission Date

November 15, 2019

Acceptance Date

December 29, 2019

Published in Issue

Year 2019 Volume: 2 Number: 2

APA
Kocer, H. E., & Ahmed, M. C. (2019). Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM). Veri Bilimi, 2(2), 39-44. https://izlik.org/JA24TD55YY
AMA
1.Kocer HE, Ahmed MC. Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM). Data Sci. J. 2019;2(2):39-44. https://izlik.org/JA24TD55YY
Chicago
Kocer, Hasan Erdinc, and Mustafa Cumaah Ahmed. 2019. “Turkish Speech Recognition Using Mel-Frequency Cepstral Coefficients(MFCC) and Hidden Markov Model (HMM)”. Veri Bilimi 2 (2): 39-44. https://izlik.org/JA24TD55YY.
EndNote
Kocer HE, Ahmed MC (December 1, 2019) Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM). Veri Bilimi 2 2 39–44.
IEEE
[1]H. E. Kocer and M. C. Ahmed, “Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM)”, Data Sci. J., vol. 2, no. 2, pp. 39–44, Dec. 2019, [Online]. Available: https://izlik.org/JA24TD55YY
ISNAD
Kocer, Hasan Erdinc - Ahmed, Mustafa Cumaah. “Turkish Speech Recognition Using Mel-Frequency Cepstral Coefficients(MFCC) and Hidden Markov Model (HMM)”. Veri Bilimi 2/2 (December 1, 2019): 39-44. https://izlik.org/JA24TD55YY.
JAMA
1.Kocer HE, Ahmed MC. Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM). Data Sci. J. 2019;2:39–44.
MLA
Kocer, Hasan Erdinc, and Mustafa Cumaah Ahmed. “Turkish Speech Recognition Using Mel-Frequency Cepstral Coefficients(MFCC) and Hidden Markov Model (HMM)”. Veri Bilimi, vol. 2, no. 2, Dec. 2019, pp. 39-44, https://izlik.org/JA24TD55YY.
Vancouver
1.Hasan Erdinc Kocer, Mustafa Cumaah Ahmed. Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM). Data Sci. J. [Internet]. 2019 Dec. 1;2(2):39-44. Available from: https://izlik.org/JA24TD55YY