EN
Speech-to-Gender Recognition Based on Machine Learning Algorithms
Abstract
Speech recognition has several application areas such as human machine interaction, classification of phone calls by gender, voice tagging, STT, etc. Predicting gender from audio signals is a problem that is easy for humans to solve, difficult to solve by a computer. In this study, a model based on MFCC and classification with machine learning is proposed for gender estimation from Turkish voice signals. Within the scope of the study, 58 different series and films were examined and a new original dataset was created with 894 audio recordings consisting of 5 sec sections taken from them. Mel-frequency cepstral coefficients (MFCC) and spectrogram, which are frequently used in the literature, were used for feature extraction from audio data. The results were first evaluated separately using two features in one way. A hybrid feature vector was then created using two feature vectors. Different machine learning algorithms (LR, DT, RF, XGB etc.) were tested in the classification process and it was seen that the best accuracy was achieved in the hybrid model and logistic regression with 89%. Recall, precision and f-score values were obtained as 86.8%, 92% and 89.3%, respectively. The obtained test results revealed that the proposed model, together with the hybrid feature vector used, the original dataset and the classifier based on machine learning, showed classification success in terms of accuracy and was a stable and robust model.
Keywords
References
- R. S. Arslan and N. Barışçı, “Development of output correction methodology for long short term memory-based speech recognition,” Sustainability, , cilt 11(15), 2019.
- R. S. Arslan and N. Barışçı, “A detailed survey of Turkish automatic speech recognition,” Turkish journal of electrical engineering and computer science, pp. 3253-3269, 2020.
- H. Erokyar, “Age and Gender Recognition for Speech Applications based on Support Vector Machines,” Florida, 2014.
- A. Oğuz, “Ses Sinyallerinden Yaş Grubu ve Cinsiyet Bilgisinin Tahmin Edilmesi,” Siirt, 2018.
- S. Hızlısoy and Z. Tüfekçi, “Noise robust speech recogniton using parallel model compensation and voice activity detection methods,” 2015 5th international conference on electronics, devices, systems, and applications(ICEDSA), pp. 1-4, 2016.
- S. Hızlısoy and R. S. Arslan, “Text independent speaker recognition based on MFCC and machine learning,” Selcuk University Journal of Engineering Sciences, no. 20(3), pp. 73-78, 2021.
- S. Hızlısoy, S. Yıldırım and Z. Tüfekçi, “Music emotional recognition using convolutional long short term memory deep neural networks,” Engineering science and technology, an international journal, no. 24(3), pp. 760-767, 2021.
- A. Tursunov, Mustaqeem, J. Y. Choeh and S. Kwon, “Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Multi-Attention Module through Speech Spectrograms,” Sensors, 09 2021.
Details
Primary Language
English
Subjects
Engineering
Journal Section
Research Article
Publication Date
December 31, 2022
Submission Date
December 19, 2022
Acceptance Date
December 28, 2022
Published in Issue
Year 2022 Volume: 10 Number: 4
APA
Hızlısoy, S., Çolakoğlu, E., & Arslan, R. S. (2022). Speech-to-Gender Recognition Based on Machine Learning Algorithms. International Journal of Applied Mathematics Electronics and Computers, 10(4), 84-92. https://doi.org/10.18100/ijamec.1221455
AMA
1.Hızlısoy S, Çolakoğlu E, Arslan RS. Speech-to-Gender Recognition Based on Machine Learning Algorithms. International Journal of Applied Mathematics Electronics and Computers. 2022;10(4):84-92. doi:10.18100/ijamec.1221455
Chicago
Hızlısoy, Serhat, Emel Çolakoğlu, and Recep Sinan Arslan. 2022. “Speech-to-Gender Recognition Based on Machine Learning Algorithms”. International Journal of Applied Mathematics Electronics and Computers 10 (4): 84-92. https://doi.org/10.18100/ijamec.1221455.
EndNote
Hızlısoy S, Çolakoğlu E, Arslan RS (December 1, 2022) Speech-to-Gender Recognition Based on Machine Learning Algorithms. International Journal of Applied Mathematics Electronics and Computers 10 4 84–92.
IEEE
[1]S. Hızlısoy, E. Çolakoğlu, and R. S. Arslan, “Speech-to-Gender Recognition Based on Machine Learning Algorithms”, International Journal of Applied Mathematics Electronics and Computers, vol. 10, no. 4, pp. 84–92, Dec. 2022, doi: 10.18100/ijamec.1221455.
ISNAD
Hızlısoy, Serhat - Çolakoğlu, Emel - Arslan, Recep Sinan. “Speech-to-Gender Recognition Based on Machine Learning Algorithms”. International Journal of Applied Mathematics Electronics and Computers 10/4 (December 1, 2022): 84-92. https://doi.org/10.18100/ijamec.1221455.
JAMA
1.Hızlısoy S, Çolakoğlu E, Arslan RS. Speech-to-Gender Recognition Based on Machine Learning Algorithms. International Journal of Applied Mathematics Electronics and Computers. 2022;10:84–92.
MLA
Hızlısoy, Serhat, et al. “Speech-to-Gender Recognition Based on Machine Learning Algorithms”. International Journal of Applied Mathematics Electronics and Computers, vol. 10, no. 4, Dec. 2022, pp. 84-92, doi:10.18100/ijamec.1221455.
Vancouver
1.Serhat Hızlısoy, Emel Çolakoğlu, Recep Sinan Arslan. Speech-to-Gender Recognition Based on Machine Learning Algorithms. International Journal of Applied Mathematics Electronics and Computers. 2022 Dec. 1;10(4):84-92. doi:10.18100/ijamec.1221455
Cited By
Gender Recognition Based on the Stacking of Different Acoustic Features
Applied Sciences
https://doi.org/10.3390/app14156564Automatic Age and Gender Recognition Using Ensemble Learning
Applied Sciences
https://doi.org/10.3390/app14166868Makine Öğrenmesi Yöntemleri Kullanılarak Ses Verilerinden Cinsiyet Tahmin Edilmesi
Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi
https://doi.org/10.19113/sdufenbed.1590769A CNN–NCP Based Hybrid Deep Learning Model for Speech-Driven Gender Classification
Bitlis Eren Üniversitesi Fen Bilimleri Dergisi
https://doi.org/10.17798/bitlisfen.1803512