Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM)

Hasan Erdinc Kocer; Mustafa Cumaah Ahmed

TR EN

Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM)

Öz

In this paper, a new Turkish spoken number recognition system proposed. The Mel-frequency cepstral coefficients (MFCC) algorithm used as a feature extraction method, the Gaussian Hidden Markov model, used for numbers phonemes modeling where each number has a Markov model. The system trained on a dataset collected from 20 subjects that includes 7 females and 13 males. Each one says the Turkish numbers from “zero” to “ten”. Audio files sampled at 8000Hz at each second and each file has one-second length and recorded in an isolated environment. We tested the system using random records for different people. The training files include 220 audio record and testing files include 18 audio record. The system achieves %83.3 accuracy, %86 precision, and %83 recall rates.

Anahtar Kelimeler

Hidden Markov Model,Mel-Frequency Cepstral Coefficients,Turkish Speech Recognition

Kaynakça

Rabiner, Lawrence R., and Biing-Hwang Juang. Fundamentals of speech recognition. Vol. 14. Englewood Cliffs: PTR Prentice Hall, 1993.
Deller, John R., John HL Hansen, and John G. Proakis. "Discrete-time processing of speech signals." (2000): 595-602
Motlıcek, Petr. Feature extraction in speech coding and recognition. Technical Report of PhD research internship in ASP Group, OGI-OHSU, http://www. fit. vutbr. cz/∼ motlicek/publi/2002/rep ogi. pdf, 2002
Hagen, Andreas, Daniel A. Connors, and Bryan L. Pellom. "The analysis and design of architecture systems for speech recognition on modern handheld-computing devices." Proceedings of the 1st IEEE/ACM/IFIP international conference on Hardware/ Software codesign and system synthesis. ACM, 2003
Ishizuka, Kentaro, and Tomohiro Nakatani. "A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition." Speech communication 48.11 (2006): 1447-1457
Xu, Min, et al. "HMM-based audio keyword generation." Pacific-Rim Conference on Multimedia. Springer, Berlin, Heidelberg, 2004
G. Evermann, H. Y. Chan, M. J. F. Gales, T. Hain, X. Liu, D. Mrva, L. Wang,and P. Woodland, “Development of the 2003 CU-HTK conversational telephone speech transcription system,” in Proceedings of ICASSP, Montreal, Canada, 2004
S. Matsoukas, J.-L. Gauvain, A. Adda, T. Colthurst, C. I. Kao, O. Kimball, L. Lamel, F. Lefevre, J. Z. Ma, J. Makhoul, L. Nguyen, R. Prasad, R. Schwartz, H. Schwenk, and B. Xiang, “Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system,” IEEE Transactions on Audio, Speech and Language Processing, vol. 14, no. 5, pp. 1541–1556, September 2006

Ayrıntılar

Birincil Dil

İngilizce

Konular

Mühendislik

Bölüm

Araştırma Makalesi

Yazarlar

Hasan Erdinc Kocer ^*
0000-0002-0799-2140
Türkiye

Mustafa Cumaah Ahmed Bu kişi benim
0000-0002-6014-6007
Türkiye

Yayımlanma Tarihi

30 Aralık 2019

Gönderilme Tarihi

15 Kasım 2019

Kabul Tarihi

29 Aralık 2019

Yayımlandığı Sayı

Yıl 2019 Cilt: 2 Sayı: 2

IZ

https://izlik.org/JA24TD55YY

Kaynak Göster

RIS / Bibtex

APA

Kocer, H. E., & Ahmed, M. C. (2019). Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM). Veri Bilimi, 2(2), 39-44. https://izlik.org/JA24TD55YY

AMA

1.Kocer HE, Ahmed MC. Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM). Veri Bilim Derg. 2019;2(2):39-44. https://izlik.org/JA24TD55YY

Chicago

Kocer, Hasan Erdinc, ve Mustafa Cumaah Ahmed. 2019. “Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM)”. Veri Bilimi 2 (2): 39-44. https://izlik.org/JA24TD55YY.

EndNote

Kocer HE, Ahmed MC (01 Aralık 2019) Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM). Veri Bilimi 2 2 39–44.

IEEE

[1]H. E. Kocer ve M. C. Ahmed, “Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM)”, Veri Bilim Derg, c. 2, sy 2, ss. 39–44, Ara. 2019, [çevrimiçi]. Erişim adresi: https://izlik.org/JA24TD55YY

ISNAD

Kocer, Hasan Erdinc - Ahmed, Mustafa Cumaah. “Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM)”. Veri Bilimi 2/2 (01 Aralık 2019): 39-44. https://izlik.org/JA24TD55YY.

JAMA

1.Kocer HE, Ahmed MC. Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM). Veri Bilim Derg. 2019;2:39–44.

MLA

Kocer, Hasan Erdinc, ve Mustafa Cumaah Ahmed. “Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM)”. Veri Bilimi, c. 2, sy 2, Aralık 2019, ss. 39-44, https://izlik.org/JA24TD55YY.

Vancouver

1.Hasan Erdinc Kocer, Mustafa Cumaah Ahmed. Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM). Veri Bilim Derg [Internet]. 01 Aralık 2019;2(2):39-44. Erişim adresi: https://izlik.org/JA24TD55YY

Mel-Frekans Kepstral Katsayılar ve Gizli Markov Model Kullanılarak Türkçe Konuşma Tanıma

Öz

Anahtar Kelimeler

Turkish Speech recognition using Mel-frequency cepstral coefficients(MFCC) and Hidden Markov Model (HMM)

Öz

Anahtar Kelimeler

Kaynakça

Ayrıntılar

Birincil Dil

Konular

Bölüm

Yazarlar

Yayımlanma Tarihi

Gönderilme Tarihi

Kabul Tarihi

Yayımlandığı Sayı

IZ

Kaynak Göster