Effects Of Background Data Duration On Speaker Verification Performance

Volume: 18 Number: 1 April 1, 2013
  • Cemal Hanilçi
  • Figen Ertaş
TR EN

Effects Of Background Data Duration On Speaker Verification Performance

Abstract

Gauss karışım modeli genel arka plan modeli (GKM-GAM) ve vektör nicemleme genel arka plan modeli (VN-GAM) konuşmacı doğrulamada sık kullanılan iki yöntemdir. Genellikle GAM modeli fazla sayıda farklı konuşmacının bulunduğu bir kümeden seçilen saatlerce uzunluktaki ses işaretleri kullanılarak eğitilir. Bu çalışmada, GAM modelinin eğitiminde kullanılan veri miktarının metinden bağımsız konuşmacı doğrulama performansına etkisi incelenmektedir. NIST 2002 konuşmacı tanıma değerlendirme veritabanı ile GKM-GAM ve VN-GAM yöntemleri kullanılarak yapılan deneysel çalışmalar arka plan modelini eğitmek için kullanılan veri miktarının konuşmacı tanıma performansına çok fazla etkisinin olmadığı görülmüştür

Keywords

References

  1. Campbell, W., Sturim, D. E., Reynolds, D. A., Support Vector Machines Using GMM Supervectors for Speaker Verification, IEEE Signal Processing Letters, Vol. 13, No. 5, pp. 308–311, May 2006.
  2. Dehak, N., Kenny, P., Dehak, R., Dumouchel, P and Ouellet, P. (2011) Front-End Factor Analysis for Speaker Verification, IEEE Transactions on Audio, Speech and Language Processing, 19(4), 788-798.
  3. Hanilçi, C. and Ertaş, F. (2011) Comparison of the impact of some Minkowski metrics on VQ/GMM based speaker recognition, Computers & Electrical Engineering, 37(1), 41-56.
  4. Hautamäki, V., Kinnunen, T., Kärkkäinen, I., Tuononen, M., Saastamoinen, J. and Fränti, P. (2008) Maximum a Posteriori Estimation of the Centroid Model for Speaker Verification, IEEE Signal Processing Letters, 15: 162--165.
  5. Kenny, P., Boulianne, G., Ouellet, P. and Dumouchel, P. (2007) Joint factor analysis versus eigenchannels in speaker recognition, IEEE Transactions on Audio, Speech and Language Processing, 15 (4), 1435-1447.
  6. Kinnunen, T., Saastamoinen, J., Hautamäki, V., Vinni, M. and Fränti, P. (2009) Comparative Evaluation of Maximum a Posteriori Vector Quantization and Gaussian Mixture Models in Speaker Verification, Pattern Recognition Letters, 30(4): 341--347.
  7. Kinnunen, T. and Li, H. (2011) An Overview of Text-Independent Speaker Recognition: from Features to Supervectors, Speech Communication 52(1), 12--40.
  8. NIST, (2001). http://www.itl.nist.gov/iad/mig/tests/sre/2002/index.html, Retrieved: July 2012, Subject: NIST 2002 SRE Evaluation Plan

Details

Primary Language

Turkish

Subjects

-

Journal Section

-

Authors

Cemal Hanilçi This is me

Figen Ertaş This is me

Publication Date

April 1, 2013

Submission Date

December 19, 2014

Acceptance Date

-

Published in Issue

Year 2013 Volume: 18 Number: 1

APA
Hanilçi, C., & Ertaş, F. (2013). Effects Of Background Data Duration On Speaker Verification Performance. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi, 18(1), 111-119. https://doi.org/10.17482/uujfe.97355
AMA
1.Hanilçi C, Ertaş F. Effects Of Background Data Duration On Speaker Verification Performance. UUJFE. 2013;18(1):111-119. doi:10.17482/uujfe.97355
Chicago
Hanilçi, Cemal, and Figen Ertaş. 2013. “Effects Of Background Data Duration On Speaker Verification Performance”. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi 18 (1): 111-19. https://doi.org/10.17482/uujfe.97355.
EndNote
Hanilçi C, Ertaş F (April 1, 2013) Effects Of Background Data Duration On Speaker Verification Performance. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi 18 1 111–119.
IEEE
[1]C. Hanilçi and F. Ertaş, “Effects Of Background Data Duration On Speaker Verification Performance”, UUJFE, vol. 18, no. 1, pp. 111–119, Apr. 2013, doi: 10.17482/uujfe.97355.
ISNAD
Hanilçi, Cemal - Ertaş, Figen. “Effects Of Background Data Duration On Speaker Verification Performance”. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi 18/1 (April 1, 2013): 111-119. https://doi.org/10.17482/uujfe.97355.
JAMA
1.Hanilçi C, Ertaş F. Effects Of Background Data Duration On Speaker Verification Performance. UUJFE. 2013;18:111–119.
MLA
Hanilçi, Cemal, and Figen Ertaş. “Effects Of Background Data Duration On Speaker Verification Performance”. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi, vol. 18, no. 1, Apr. 2013, pp. 111-9, doi:10.17482/uujfe.97355.
Vancouver
1.Cemal Hanilçi, Figen Ertaş. Effects Of Background Data Duration On Speaker Verification Performance. UUJFE. 2013 Apr. 1;18(1):111-9. doi:10.17482/uujfe.97355

Announcements:

30.03.2021-Beginning with our April 2021 (26/1) issue, in accordance with the new criteria of TR-Dizin, the Declaration of Conflict of Interest and the Declaration of Author Contribution forms fulfilled and signed by all authors are required as well as the Copyright form during the initial submission of the manuscript. Furthermore two new sections, i.e. ‘Conflict of Interest’ and ‘Author Contribution’, should be added to the manuscript. Links of those forms that should be submitted with the initial manuscript can be found in our 'Author Guidelines' and 'Submission Procedure' pages. The manuscript template is also updated. For articles reviewed and accepted for publication in our 2021 and ongoing issues and for articles currently under review process, those forms should also be fulfilled, signed and uploaded to the system by authors.