TWO-STAGE DECISION MAKING ALGORITHM FOR SPEAKER VERIFICATION WITH TRAINING SET OPTIMIZATION
Abstract
In this paper, a two-stage decision making algorithm is proposed for the task of speaker verification. This two-stage algorithm aims to eliminate the first-stage qualifying impostors by the help of impostor-resistant structure in the second stage. First, a baseline system is formed using mel-frequency cepstral coefficients (MFCC) as features and, a radial basis function (RBF) neural network for speaker modelling. Then, the investigations have been realized for optimizing the training set by means of two issues: (1) the ratio of impostor features to genuine speaker features, (2) the ratio of same gender features to opposite gender features (in respect of the genuine speaker) within the impostor speakers’ set. Last, the two-stage decision making algorithm is presented, and the performance enhancement provided by the two-stage system is given with the test results.
Keywords
References
- [1] WIQAS G., NAVDEEP S., “Literature Review on Automatic Speech Recognition”. International Journal of Computer Applications, 41, 42-50, 2012.
- [2] SHIKHA G., AMIT P., ACHAL S., “A Study on Speech Recognition System: A Literature Review”, International Journal of Science, Engineering and Technology Research (IJSETR), 3, 2192-2196, 2014.
- [3] LIU Y, QIAN Y., CHAN N., FU T., ZHANG Y., YU K., “Deep Feature for Text-dependent Speaker verification”, Speech Communication, 73, 1–13, 2015.
- [4] BHATTACHARYYA S, SRIKANTHAN T, KRISHNAMURTHY P, “Ideal GMM parameters and posterior log likelihood in speaker verification”, Proc. IEEE Signal Processing Soc. Neural Networks for Signal Processing XI, 471-480, 2001.
- [5] XU Y., SHEN F., ZHAO J., “An incremental learning vector quantization algorithm for pattern classification”. Neural Computing and Applications, 21, 1205–1215, 2012.
- [6] GALES M., YOUNG S., “The Application of Hidden Markov Models in Speech Recognition”. Foundations and Trends in Signal Processing, 1, 195–304, 2007.
- [7] PATEL I., SRINIVAS Y. R., “A Frequency Spectral Feature Modeling for Hidden Markov Model Based Automated Speech Recognition” Recent Trends in Networks and Communications, Communications in Computer and Information Science, 90, 134-143. Springer, Berlin, Heidelberg, 2010.
- [8] KAMRUZZAMAN S. M., A. N. M. REZAUL KARIM A. N. M., ISLAM S., HAQUE E., “Speaker Identification using MFCC-Domain Support Vector Machine”, International Journal of Electrical and Power Engineering, 1, 274-278, 2007.
Details
Primary Language
English
Subjects
Electrical Engineering
Journal Section
Research Article
Authors
Efe Tankut Yaparoğlu
This is me
0000-0003-1537-1237
Türkiye
Yavuz Şenol
*
0000-0002-3686-5597
Türkiye
Publication Date
January 28, 2019
Submission Date
February 8, 2018
Acceptance Date
September 26, 2018
Published in Issue
Year 2019 Volume: 8 Number: 1