Research Article

Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms

Volume: 33 December 30, 2021
EN TR

Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms

Abstract

Speech and speaker recognition systems aim to analyze parametric information contained in the human voice and recognize it at the highest possible rate. One of the most important features in the audio signal for the speaker to be recognized successfully by the system is the speaker's accent. Speaker accent recognition systems are based on the analysis of patterns such as the way the speaker speaks and the word choice he uses while speaking. In this study, the data obtained by the MFCC feature extraction technique from voice signals of 367 speakers with 7 different accents were used. The data of 330 speakers in the data set were taken from the "Speaker Accent Recognition" data set in the UC Irvine Machine Learning (ML) open data source. The data of the other 37 speakers were obtained by converting the voice recordings in the "Speaker Accent Archive" data set created by George Mason University into data using the MFCC feature extraction technique. 9 ML classification algorithms were used for the designed speaker accent recognition system. Also, the k-fold cross-validation technique was used to test the data set independently. In this way, the performance of ML algorithms is shown when the data set is divided into a k number of parts. Information about the classification algorithms used in the designed system and the hyperparameter optimizations made in these algorithms are also given. The success performances of the classification algorithms are shown with performance metrics.

Keywords

Thanks

Bu çalışma ASYU2020_Akıllı Sistemlerde Yenilikler ve Uygulamaları Özel sayısı için değerlendirilmek üzere gönderilmiştir.

References

  1. Van Leeuwen D. A., Martin A. F., Przybocki M. A., and Bouten J. S., “NIST and TNO-NFI evaluations of automatic speaker recognition,” Comput. Speech Lang., vol. 20, pp. 128–158, 2006.
  2. Furui, S. “50 Years of Progress in Speech and Speaker Recognition Research.” (1970).
  3. Kinnunen T. and Li H., “An overview of text-independent speaker recognition: From features to supervectors,” Speech communication, vol. 52, no. 1, pp. 12–40, 2010.
  4. Nakagawa S., Wang L. and Ohtsuka S., "Speaker identification and verification by combining MFCC and phase information", IEEE Trans. Audio Speech Lang. Process., vol. 20, no. 4, pp. 1085-1095, May 2012.
  5. Faria A., "Accent classification for speech recognition", proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI '05), 2005.
  6. Turner C. and Joseph A., “A wavelet packet and mel-frequency cepstral coefficients-based feature extraction method for speaker identification”, Procedia Computer Science, 61, pp. 416-421, 2015.
  7. De-la-Calle-Silos F. and Stern R. M., "Synchrony-Based Feature Extraction for Robust Automatic Speech Recognition," in IEEE Signal Processing Letters, vol. 24, no. 8, pp. 1158-1162, Aug. 2017.
  8. Ranjan R. and Thakur A., "Analysis of feature extraction techniques for speech recognition system", International Journal of Innovative Technology and Exploring Engineering, vol. 8, no. 7C2, pp. 197-200, 2019.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

December 30, 2021

Submission Date

March 14, 2021

Acceptance Date

December 8, 2021

Published in Issue

Year 2021 Volume: 33

APA
Ayrancı, A. A., Atay, S., & Yıldırım, T. (2021). Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms. International Journal of Advances in Engineering and Pure Sciences, 33, 17-27. https://doi.org/10.7240/jeps.896427
AMA
1.Ayrancı AA, Atay S, Yıldırım T. Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms. JEPS. 2021;33:17-27. doi:10.7240/jeps.896427
Chicago
Ayrancı, Ahmet Aytuğ, Sergen Atay, and Tülay Yıldırım. 2021. “Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms”. International Journal of Advances in Engineering and Pure Sciences 33 (December): 17-27. https://doi.org/10.7240/jeps.896427.
EndNote
Ayrancı AA, Atay S, Yıldırım T (December 1, 2021) Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms. International Journal of Advances in Engineering and Pure Sciences 33 17–27.
IEEE
[1]A. A. Ayrancı, S. Atay, and T. Yıldırım, “Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms”, JEPS, vol. 33, pp. 17–27, Dec. 2021, doi: 10.7240/jeps.896427.
ISNAD
Ayrancı, Ahmet Aytuğ - Atay, Sergen - Yıldırım, Tülay. “Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms”. International Journal of Advances in Engineering and Pure Sciences 33 (December 1, 2021): 17-27. https://doi.org/10.7240/jeps.896427.
JAMA
1.Ayrancı AA, Atay S, Yıldırım T. Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms. JEPS. 2021;33:17–27.
MLA
Ayrancı, Ahmet Aytuğ, et al. “Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms”. International Journal of Advances in Engineering and Pure Sciences, vol. 33, Dec. 2021, pp. 17-27, doi:10.7240/jeps.896427.
Vancouver
1.Ahmet Aytuğ Ayrancı, Sergen Atay, Tülay Yıldırım. Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms. JEPS. 2021 Dec. 1;33:17-2. doi:10.7240/jeps.896427

Cited By