EN
TR
Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms
Öz
Speech and speaker recognition systems aim to analyze parametric information contained in the human voice and recognize it at the highest possible rate. One of the most important features in the audio signal for the speaker to be recognized successfully by the system is the speaker's accent. Speaker accent recognition systems are based on the analysis of patterns such as the way the speaker speaks and the word choice he uses while speaking. In this study, the data obtained by the MFCC feature extraction technique from voice signals of 367 speakers with 7 different accents were used. The data of 330 speakers in the data set were taken from the "Speaker Accent Recognition" data set in the UC Irvine Machine Learning (ML) open data source. The data of the other 37 speakers were obtained by converting the voice recordings in the "Speaker Accent Archive" data set created by George Mason University into data using the MFCC feature extraction technique. 9 ML classification algorithms were used for the designed speaker accent recognition system. Also, the k-fold cross-validation technique was used to test the data set independently. In this way, the performance of ML algorithms is shown when the data set is divided into a k number of parts. Information about the classification algorithms used in the designed system and the hyperparameter optimizations made in these algorithms are also given. The success performances of the classification algorithms are shown with performance metrics.
Anahtar Kelimeler
Teşekkür
Bu çalışma ASYU2020_Akıllı Sistemlerde Yenilikler ve Uygulamaları Özel sayısı için değerlendirilmek üzere gönderilmiştir.
Kaynakça
- Van Leeuwen D. A., Martin A. F., Przybocki M. A., and Bouten J. S., “NIST and TNO-NFI evaluations of automatic speaker recognition,” Comput. Speech Lang., vol. 20, pp. 128–158, 2006.
- Furui, S. “50 Years of Progress in Speech and Speaker Recognition Research.” (1970).
- Kinnunen T. and Li H., “An overview of text-independent speaker recognition: From features to supervectors,” Speech communication, vol. 52, no. 1, pp. 12–40, 2010.
- Nakagawa S., Wang L. and Ohtsuka S., "Speaker identification and verification by combining MFCC and phase information", IEEE Trans. Audio Speech Lang. Process., vol. 20, no. 4, pp. 1085-1095, May 2012.
- Faria A., "Accent classification for speech recognition", proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI '05), 2005.
- Turner C. and Joseph A., “A wavelet packet and mel-frequency cepstral coefficients-based feature extraction method for speaker identification”, Procedia Computer Science, 61, pp. 416-421, 2015.
- De-la-Calle-Silos F. and Stern R. M., "Synchrony-Based Feature Extraction for Robust Automatic Speech Recognition," in IEEE Signal Processing Letters, vol. 24, no. 8, pp. 1158-1162, Aug. 2017.
- Ranjan R. and Thakur A., "Analysis of feature extraction techniques for speech recognition system", International Journal of Innovative Technology and Exploring Engineering, vol. 8, no. 7C2, pp. 197-200, 2019.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Mühendislik
Bölüm
Araştırma Makalesi
Yayımlanma Tarihi
30 Aralık 2021
Gönderilme Tarihi
14 Mart 2021
Kabul Tarihi
8 Aralık 2021
Yayımlandığı Sayı
Yıl 2021 Cilt: 33
APA
Ayrancı, A. A., Atay, S., & Yıldırım, T. (2021). Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms. International Journal of Advances in Engineering and Pure Sciences, 33, 17-27. https://doi.org/10.7240/jeps.896427
AMA
1.Ayrancı AA, Atay S, Yıldırım T. Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms. JEPS. 2021;33:17-27. doi:10.7240/jeps.896427
Chicago
Ayrancı, Ahmet Aytuğ, Sergen Atay, ve Tülay Yıldırım. 2021. “Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms”. International Journal of Advances in Engineering and Pure Sciences 33 (Aralık): 17-27. https://doi.org/10.7240/jeps.896427.
EndNote
Ayrancı AA, Atay S, Yıldırım T (01 Aralık 2021) Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms. International Journal of Advances in Engineering and Pure Sciences 33 17–27.
IEEE
[1]A. A. Ayrancı, S. Atay, ve T. Yıldırım, “Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms”, JEPS, c. 33, ss. 17–27, Ara. 2021, doi: 10.7240/jeps.896427.
ISNAD
Ayrancı, Ahmet Aytuğ - Atay, Sergen - Yıldırım, Tülay. “Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms”. International Journal of Advances in Engineering and Pure Sciences 33 (01 Aralık 2021): 17-27. https://doi.org/10.7240/jeps.896427.
JAMA
1.Ayrancı AA, Atay S, Yıldırım T. Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms. JEPS. 2021;33:17–27.
MLA
Ayrancı, Ahmet Aytuğ, vd. “Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms”. International Journal of Advances in Engineering and Pure Sciences, c. 33, Aralık 2021, ss. 17-27, doi:10.7240/jeps.896427.
Vancouver
1.Ahmet Aytuğ Ayrancı, Sergen Atay, Tülay Yıldırım. Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms. JEPS. 01 Aralık 2021;33:17-2. doi:10.7240/jeps.896427
Cited By
A systematic review of accent classification techniques and datasets for inclusive speech recognition
International Journal of Data Science and Analytics
https://doi.org/10.1007/s41060-025-00954-1