Research Article

Robust Spoofed Speech Detection with Denoised I-vectors

Volume: 36 Number: 4 December 1, 2023
EN

Robust Spoofed Speech Detection with Denoised I-vectors

Abstract

Spoofed speech detection is recently gaining attention of the researchers as speaker verification is shown to be vulnerable to spoofing attacks such as voice conversion, speech synthesis, replay, and impersonation. Although various different methods have been proposed to detect spoofed speech, their performances decrease dramatically under the mismatched conditions due to the additive or reverberant noises. Conventional speech enhancement methods fail to recover the performance gap, hence more advanced techniques seem to be necessary to solve the noisy spoofed speech detection problem. In this work, Denoising Autoencoder (DAE) is used to obtain clean estimates of i-vectors from their noisy versions. ASVspoof 2015 database is used in the experiments with five different noise types, added to the original utterances at 0, 10, and 20 dB signal-to-noise ratios (SNR). The experimental results verified that the DAE provides a more robust spoof detection, where the conventional methods fail.

Keywords

Supporting Institution

Tübitak

Project Number

121E057

Thanks

This study was supported by TUBITAK under project no. 121E057.

References

  1. [1] Reynolds, D.A., Quatieri, T.F., and Dunn, R.B., “Speaker Verification Using Adapted Gaussian Mixture Models”, Digital Signal Processing, 10: 19–41, (2000).
  2. [2] Dehak, N., Kenny, P.J., Dehak, R., Dumouchel, P., and Ouellet, P., “Front-End Factor Analysis for Speaker Verification”, IEEE Transactions on Audio, Speech, and Language Processing, 19: 788–798, (2011).
  3. [3] Snyder, D., Ghahremani, P., Povey, D., Garcia-Romero, D., Carmiel, Y., and Khudanpur, S., “Deep neural network-based speaker embeddings for end-to-end speaker verification”, in 2016 IEEE Spoken Language Technology Workshop (SLT), 165–170, (2016).
  4. [4] Wu, Z., Chng, E.S., and Li, H., “Detecting converted speech and natural speech for anti-spoofing attack in speaker recognition”, in 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, 2: 1698–1701, (2012).
  5. [5] Wu, Z., Khodabakhsh, A., Demiroglu, C., Yamagishi, J., Saito, D., Toda, T., and King, S., “SAS: A speaker verification spoofing database containing diverse attacks”, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4440–4444, (2015).
  6. [6] Wu, Z., Evans, N., Kinnunen, T., Yamagishi, J., Alegre, F., and Li, H., “Spoofing and countermeasures for speaker verification: A survey”, Speech Communication, 66: 130–153, (2015).
  7. [7] Gales, M.F.J., and Young, S.J., “Robust speech recognition in additive and convolutional noise using parallel model combination”, Computer Speech and Language, 9(4): 289–307, (1995).
  8. [8] Fujimoto, M., and Riki, Y.A., “Robust speech recognition in additive and channel noise environments using GMM and EM algorithm”, in 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, 1: I-941–4, (2004).

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

December 1, 2023

Submission Date

January 25, 2022

Acceptance Date

October 6, 2022

Published in Issue

Year 2023 Volume: 36 Number: 4

APA
Dişken, G. (2023). Robust Spoofed Speech Detection with Denoised I-vectors. Gazi University Journal of Science, 36(4), 1553-1561. https://doi.org/10.35378/gujs.1062788
AMA
1.Dişken G. Robust Spoofed Speech Detection with Denoised I-vectors. Gazi University Journal of Science. 2023;36(4):1553-1561. doi:10.35378/gujs.1062788
Chicago
Dişken, Gökay. 2023. “Robust Spoofed Speech Detection With Denoised I-Vectors”. Gazi University Journal of Science 36 (4): 1553-61. https://doi.org/10.35378/gujs.1062788.
EndNote
Dişken G (December 1, 2023) Robust Spoofed Speech Detection with Denoised I-vectors. Gazi University Journal of Science 36 4 1553–1561.
IEEE
[1]G. Dişken, “Robust Spoofed Speech Detection with Denoised I-vectors”, Gazi University Journal of Science, vol. 36, no. 4, pp. 1553–1561, Dec. 2023, doi: 10.35378/gujs.1062788.
ISNAD
Dişken, Gökay. “Robust Spoofed Speech Detection With Denoised I-Vectors”. Gazi University Journal of Science 36/4 (December 1, 2023): 1553-1561. https://doi.org/10.35378/gujs.1062788.
JAMA
1.Dişken G. Robust Spoofed Speech Detection with Denoised I-vectors. Gazi University Journal of Science. 2023;36:1553–1561.
MLA
Dişken, Gökay. “Robust Spoofed Speech Detection With Denoised I-Vectors”. Gazi University Journal of Science, vol. 36, no. 4, Dec. 2023, pp. 1553-61, doi:10.35378/gujs.1062788.
Vancouver
1.Gökay Dişken. Robust Spoofed Speech Detection with Denoised I-vectors. Gazi University Journal of Science. 2023 Dec. 1;36(4):1553-61. doi:10.35378/gujs.1062788

Cited By