Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination

Volume: 11 Number: 1 March 28, 2012
EN

Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination

Abstract

The speech/music discrimination systems have gaining importance in several intelligent audio retrieval algorithms due to the increasing size of the multimedia sources in our daily lives. This study aims to propose a speech/music discrimination system which utilizes the advantages of the wavelet transform. Also, the performance of the discrete wavelet transform and the dual- tree wavelet transform has been compared with the conventional time, frequency and cepstral domain features used in speech/music discrimination. The speech and music samples collected from common databases, CD recording and internet radios have been classified with artificial neural networks with different feature sets. The principal component analysis has been applied to eliminate the correlated features before classification stage. Considering the number of vanishing moments and orthogonality, the best performance has been obtained with Daubechies8 wavelet among the other members of the Daubechies family. According to the results, the proposed feature set outperforms the traditional ones.
Keywords: Speech/music discrimination, Discrete wavelet transform, Dual-tree wavelet transform, Daubechies mother wavelet.

Keywords

References

  1. Ambikairajah, O. M. E., Epps, J., “Novel features for effective speech and music discrimination,” in Proc. IEEE Int. Conf. on Engineering of Intelligent Systems, pp. 1–5, 2006.
  2. Exposito, N. R. J.E.M., Galan, S.G., Candeas, P., “Audio coding improvement using evolutionary speech/music discrimination,” in Proc. IEEE Int. Conf. on Fuzzy Systems (FUZZ-IEEE), pp. 1–6, 2007.
  3. El-Maleh, K., Petrucci, M. G., Kabal, P., “Speech/music discrimination for multimedia applications,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 2445–2448, 2000.
  4. Gedik, A., Bozkurt, B., “Pitch frequency histogram based music information retrieval for turkish music,” Signal Processing, vol. 10, pp. 1049–1063, 2010.
  5. Saunders, J., “Real time discrimination of broadcast speech/music,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 993–996, 1996.
  6. Scheier, E., Slaney, M., “Construction and evaluation of a robust multifeature speech/music discriminator,” in Proc. IEEE Int. Conf. On Acoustics, Speech, and Signal Processing, ICASSP’97, pp. 1331–1334, 1997
  7. Ajmera, I. M. J., Bourlard, H., “Speech/music segmentation using entropy and dynamism features in a HMM classification framework,” Speech Communication, vol. 40, pp. 351–363, 2003.
  8. Panagiotakis, C., Tziritas, G., “A speech/music discriminator based on RMS and zero-crossings,” IEEE Trans. Multimedia, vol. 7, pp. 155–166, 2005.

Details

Primary Language

English

Subjects

-

Journal Section

-

Authors

Timur Düzenli This is me

Publication Date

March 28, 2012

Submission Date

March 28, 2012

Acceptance Date

-

Published in Issue

Year 2011 Volume: 11 Number: 1

APA
Düzenli, T., & Özkurt, N. (2012). Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination. IU-Journal of Electrical & Electronics Engineering, 11(1), 1355-1362. https://izlik.org/JA35FP32MZ
AMA
1.Düzenli T, Özkurt N. Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination. IU-Journal of Electrical & Electronics Engineering. 2012;11(1):1355-1362. https://izlik.org/JA35FP32MZ
Chicago
Düzenli, Timur, and Nalan Özkurt. 2012. “Comparison OF Wavelet Based Feature Extraction Methods for Speech Music Discrimination”. IU-Journal of Electrical & Electronics Engineering 11 (1): 1355-62. https://izlik.org/JA35FP32MZ.
EndNote
Düzenli T, Özkurt N (March 1, 2012) Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination. IU-Journal of Electrical & Electronics Engineering 11 1 1355–1362.
IEEE
[1]T. Düzenli and N. Özkurt, “Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination”, IU-Journal of Electrical & Electronics Engineering, vol. 11, no. 1, pp. 1355–1362, Mar. 2012, [Online]. Available: https://izlik.org/JA35FP32MZ
ISNAD
Düzenli, Timur - Özkurt, Nalan. “Comparison OF Wavelet Based Feature Extraction Methods for Speech Music Discrimination”. IU-Journal of Electrical & Electronics Engineering 11/1 (March 1, 2012): 1355-1362. https://izlik.org/JA35FP32MZ.
JAMA
1.Düzenli T, Özkurt N. Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination. IU-Journal of Electrical & Electronics Engineering. 2012;11:1355–1362.
MLA
Düzenli, Timur, and Nalan Özkurt. “Comparison OF Wavelet Based Feature Extraction Methods for Speech Music Discrimination”. IU-Journal of Electrical & Electronics Engineering, vol. 11, no. 1, Mar. 2012, pp. 1355-62, https://izlik.org/JA35FP32MZ.
Vancouver
1.Timur Düzenli, Nalan Özkurt. Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination. IU-Journal of Electrical & Electronics Engineering [Internet]. 2012 Mar. 1;11(1):1355-62. Available from: https://izlik.org/JA35FP32MZ