How does language model size effects speech recognition accuracy for the Turkish language?

Behnam Asefisaray; Erhan Mengüşoğlu; Murat Hacıömeroğlu; Hayri Sever

EN TR

How does language model size effects speech recognition accuracy for the Turkish language?

Abstract

In this paper we aimed at investigating the effect of Language Model (LM) size on Speech Recognition (SR) accuracy. We also provided details of our approach for obtaining the LM for Turkish. Since LM is obtained by statistical processing of raw text, we expect that by increasing the size of available data for training the LM, SR accuracy will improve. Since this study is based on recognition of Turkish, which is a highly agglutinative language, it is important to find out the appropriate size for the training data. The minimum required data size is expected to be much higher than the data needed to train a language model for a language with low level of agglutination such as English. In the experiments we also tried to adjust the Language Model Weight (LMW) and Active Token Count (ATC) parameters of LM as these are expected to be different for a highly agglutinative language. We showed that by increasing the training data size to an appropriate level, the recognition accuracy improved on the other hand changes on LMW and ATC did not have a positive effect on Turkish speech recognition accuracy.

Keywords

-

Kaynakça

Aksungurlu T, Parlak S, Sak H, Saraclar M. "Comparison of language modeling approaches for Turkish broadcast news". IEEE 16th Signal Processing, Communication and Applications Conference, Aydın, Turkey, 20-22 April 2008.
Korkmazsky F, Jojic O, Shevade B. "Boosting of speech recognition performance by language model adaptation". IEEE Aerospace Conference, Big Sky, MT, USA, 3-10 March 2007.
Salor Ö, Pellom BL, Ciloglu T, Hacioglu K, Demirekler M. "On developing new text and audio corpora and speech recognition tools for the Turkish language". 7th International Conference on Spoken Language Processing (INTERSPEECH), Denver, CO, USA, 16-20 September 2002.
Suzuki M, Kajiura Y, Ito A, Makino S. "Unsupervised language model adaptation based on automatic text collection from WWW". 9th International Conference on Spoken Language Processing (INTERSPEECH), Pittsburgh, PA, USA, 17-21 September 2006.
Klakov D. “Language Model adaptation for tiny adaptation corpora”, 9th International Conference on Spoken Language Processing (INTERSPEECH), Pittsburgh, PA, USA, 17-21 September 2006.
Dai J. "Hybrid approach to speech recognition using hidden Markov models and Markov chains". Vision, Image and Signal Processing, 141(5), 273-279, 1994.
Woodland PC, Johnson SE, Jourlin P, Sparck Jones K. "Effects of out of vocabulary words in spoken document retrieval". 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Athens, Greece, 24-28 July 2000.
Aksoylar C, Mutluergil SO, Erdogan H. "The anatomy of a turkish speech recognition system". IEEE 17th Signal Processing and Communications Applications Conference, Antalya, Turkey, 09-11 April 2009.

Chen Z, Lee KF, Li M."Discriminative training on language model". 6th International Conference on Spoken Language Processing (INTERSPEECH), Beijing, China, 16-20 October 2000.
Adda-Decker M, Adda G, Gauvain J, Lamel L. "Large vocabulary speech recognition in French". IEEE International Conference on Acoustics, Speech and Signal Processing, Phoenix, AZ, USA, 15-19 March 1999.
Adda G, Adda-Decker M, Gauvain JI, Lamel L. "Text normalization and speech recognition in French". 5th European Conference on Speech Communication and Technology (Eurospeech), Rhodes, Greece, 22-25 September 1997.
Zhuang L, Bao T, Zhu X, Wang C, Naoi S. "A Chinese OCR spelling check approach based on statistical language models". IEEE International Conference on Systems, Man and Cybernetics, Den Haag, the Nederland, 10-13 October 2004.
Can F, Kocberber S, Baglioglu O, Kardas S, Ocalan HC, Uyar E. "New event detection and topic tracking in Turkish". Journal of the American Society for Information Science and Technology, 61(4), 802-819, 2010.
Isotani R, Matsunaga S. "Speech recognition using a stochastic language model integrating local and global constraints". ARPA Spoken Language Technology (SLT) Workshop, Plainsboro, NJ, USA, 1994.
Stolcke A. "SRILM-An Extensible language modeling toolkit". 7th International Conference on Spoken Language Processing (INTERSPEECH), Denver, CO, USA, 16-20 September 2002.
Yazgan A, Saraclar M. "Hybrid language models for out of vocabulary word detection in large vocabulary conversational speech recognition". IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Montreal, Quebec, Canada, 17-21 May 2004.

Ayrıntılar

Birincil Dil

İngilizce

Konular

-

Bölüm

-

Yazarlar

Behnam Asefisaray Bu kişi benim

Erhan Mengüşoğlu Bu kişi benim

Murat Hacıömeroğlu

Hayri Sever

Yayımlanma Tarihi

1 Mayıs 2016

Gönderilme Tarihi

2 Mayıs 2016

Kabul Tarihi

-

Yayımlandığı Sayı

Yıl 2016 Cilt: 22 Sayı: 2

IZ

https://izlik.org/JA98ZY43YT

Kaynak Göster

RIS / Bibtex

APA

Asefisaray, B., Mengüşoğlu, E., Hacıömeroğlu, M., & Sever, H. (2016). How does language model size effects speech recognition accuracy for the Turkish language? Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, 22(2), 100-105. https://izlik.org/JA98ZY43YT

AMA

1.Asefisaray B, Mengüşoğlu E, Hacıömeroğlu M, Sever H. How does language model size effects speech recognition accuracy for the Turkish language? Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi. 2016;22(2):100-105. https://izlik.org/JA98ZY43YT

Chicago

Asefisaray, Behnam, Erhan Mengüşoğlu, Murat Hacıömeroğlu, ve Hayri Sever. 2016. “How does language model size effects speech recognition accuracy for the Turkish language?”. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi 22 (2): 100-105. https://izlik.org/JA98ZY43YT.

EndNote

Asefisaray B, Mengüşoğlu E, Hacıömeroğlu M, Sever H (01 Mayıs 2016) How does language model size effects speech recognition accuracy for the Turkish language? Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi 22 2 100–105.

IEEE

[1]B. Asefisaray, E. Mengüşoğlu, M. Hacıömeroğlu, ve H. Sever, “How does language model size effects speech recognition accuracy for the Turkish language?”, Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, c. 22, sy 2, ss. 100–105, May. 2016, [çevrimiçi]. Erişim adresi: https://izlik.org/JA98ZY43YT

ISNAD

Asefisaray, Behnam - Mengüşoğlu, Erhan - Hacıömeroğlu, Murat - Sever, Hayri. “How does language model size effects speech recognition accuracy for the Turkish language?”. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi 22/2 (01 Mayıs 2016): 100-105. https://izlik.org/JA98ZY43YT.

JAMA

1.Asefisaray B, Mengüşoğlu E, Hacıömeroğlu M, Sever H. How does language model size effects speech recognition accuracy for the Turkish language? Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi. 2016;22:100–105.

MLA

Asefisaray, Behnam, vd. “How does language model size effects speech recognition accuracy for the Turkish language?”. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, c. 22, sy 2, Mayıs 2016, ss. 100-5, https://izlik.org/JA98ZY43YT.

Vancouver

1.Behnam Asefisaray, Erhan Mengüşoğlu, Murat Hacıömeroğlu, Hayri Sever. How does language model size effects speech recognition accuracy for the Turkish language? Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi [Internet]. 01 Mayıs 2016;22(2):100-5. Erişim adresi: https://izlik.org/JA98ZY43YT

How does language model size effects speech recognition accuracy for the Turkish language?

How does language model size effects speech recognition accuracy for the Turkish language?

Abstract

Keywords

Türkçe ses tanıma sistemlerinde dil modeli boyutunun doğruluk oranına etkisi

Öz

Anahtar Kelimeler

Kaynakça

Ayrıntılar

Birincil Dil

Konular

Bölüm

Yazarlar

Yayımlanma Tarihi

Gönderilme Tarihi

Kabul Tarihi

Yayımlandığı Sayı

IZ

Kaynak Göster