EN
Noise-Robust Spoofed Speech Detection Using Discriminative Autoencoder
Öz
Audio spoof detection gained attention of the researchers recently, as it is vital to detect spoofed speech for automatic speaker recognition systems. Publicly available datasets also accelerated the studies in this area. Many different features and classifiers have been proposed to overcome the spoofed speech detection problem, and some of them achieved considerably high performances. However, under additive noise, the spoof detection performance drops rapidly. On the other hand, number of studies about robust spoofed speech detection is very limited. The problem becomes more interesting as the conventional speech enhancement methods reportedly performed worse than no enhancement. In this work, i-vectors are used for spoof detection, and discriminative denoising autoencoder (DAE) network is used to obtain enhanced (clean) i-vectors from their noisy counterparts. Once the enhanced i-vectors are obtained, they can be treated as normal i-vectors and can be scored/classified without any modifications in the classifier part. Data from ASVspoof 2015 challenge is used with five different additive noise types, following a similar configuration of previous studies. The DAE is trained with a multicondition manner, using both clean and corrupted i-vectors. Three different noise types at various signal-to-noise ratios are used to create corrupted i-vectors, and two different noise types are used only in the test stage to simulate unknown noise conditions. Experimental results showed that the proposed DAE approach is more effective than the conventional speech enhancement methods.
Anahtar Kelimeler
Destekleyen Kurum
Tübitak
Proje Numarası
121E057
Teşekkür
This work was supported by TUBITAK under project number 121E057.
Kaynakça
- [1] Z. Wu, E. S. Chng, and H. Li, “Detecting converted speech and natural speech for anti-spoofing attack in speaker recognition,” in 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, 2012, vol. 2, pp. 1698–1701.
- [2] A. Nautsch et al., “ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech,” IEEE Trans. Biometrics, Behav. Identity Sci., vol. 3, no. 2, pp. 252–265, Apr. 2021.
- [3] H. Delgado et al., “ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements,” in Odyssey 2018 The Speaker and Language Recognition Workshop, 2018, pp. 296–303.
- [4] Z. Wu et al., “ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge,” IEEE J. Sel. Top. Signal Process., vol. 11, no. 4, pp. 588–604, Jun. 2017.
- [5] M. Todisco, H. Delgado, and N. Evans, “Constant Q cepstral coefficients: A spoofing countermeasure for automatic speaker verification,” Comput. Speech Lang., vol. 45, pp. 516–535, Sep. 2017.
- [6] J. Yang and L. Liu, “Playback speech detection based on magnitude-phase spectrum,” Electron. Lett., vol. 54, no. 14, pp. 901–903, Jul. 2018.
- [7] A. T. Patil, H. A. Patil, and K. Khoria, “Effectiveness of energy separation-based instantaneous frequency estimation for cochlear cepstral features for synthetic and voice-converted spoofed speech detection,” Comput. Speech Lang., vol. 72, no. 1, p. 101301, Mar. 2022.
- [8] J. Yang, R. K. Das, and N. Zhou, “Extraction of Octave Spectra Information for Spoofing Attack Detection,” IEEE/ACM Trans. Audio, Speech, Lang. Process., vol. 27, no. 12, pp. 2373–2384, Dec. 2019.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Mühendislik
Bölüm
Araştırma Makalesi
Yayımlanma Tarihi
29 Haziran 2023
Gönderilme Tarihi
20 Haziran 2022
Kabul Tarihi
8 Haziran 2023
Yayımlandığı Sayı
Yıl 2023 Cilt: 19 Sayı: 2
APA
Dişken, G., & Tüfekci, Z. (2023). Noise-Robust Spoofed Speech Detection Using Discriminative Autoencoder. Celal Bayar University Journal of Science, 19(2), 167-174. https://doi.org/10.18466/cbayarfbe.1132319
AMA
1.Dişken G, Tüfekci Z. Noise-Robust Spoofed Speech Detection Using Discriminative Autoencoder. Celal Bayar University Journal of Science. 2023;19(2):167-174. doi:10.18466/cbayarfbe.1132319
Chicago
Dişken, Gökay, ve Zekeriya Tüfekci. 2023. “Noise-Robust Spoofed Speech Detection Using Discriminative Autoencoder”. Celal Bayar University Journal of Science 19 (2): 167-74. https://doi.org/10.18466/cbayarfbe.1132319.
EndNote
Dişken G, Tüfekci Z (01 Haziran 2023) Noise-Robust Spoofed Speech Detection Using Discriminative Autoencoder. Celal Bayar University Journal of Science 19 2 167–174.
IEEE
[1]G. Dişken ve Z. Tüfekci, “Noise-Robust Spoofed Speech Detection Using Discriminative Autoencoder”, Celal Bayar University Journal of Science, c. 19, sy 2, ss. 167–174, Haz. 2023, doi: 10.18466/cbayarfbe.1132319.
ISNAD
Dişken, Gökay - Tüfekci, Zekeriya. “Noise-Robust Spoofed Speech Detection Using Discriminative Autoencoder”. Celal Bayar University Journal of Science 19/2 (01 Haziran 2023): 167-174. https://doi.org/10.18466/cbayarfbe.1132319.
JAMA
1.Dişken G, Tüfekci Z. Noise-Robust Spoofed Speech Detection Using Discriminative Autoencoder. Celal Bayar University Journal of Science. 2023;19:167–174.
MLA
Dişken, Gökay, ve Zekeriya Tüfekci. “Noise-Robust Spoofed Speech Detection Using Discriminative Autoencoder”. Celal Bayar University Journal of Science, c. 19, sy 2, Haziran 2023, ss. 167-74, doi:10.18466/cbayarfbe.1132319.
Vancouver
1.Gökay Dişken, Zekeriya Tüfekci. Noise-Robust Spoofed Speech Detection Using Discriminative Autoencoder. Celal Bayar University Journal of Science. 01 Haziran 2023;19(2):167-74. doi:10.18466/cbayarfbe.1132319