EN
Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination
Abstract
The speech/music discrimination systems have gaining importance in several intelligent audio retrieval algorithms due to the increasing size of the multimedia sources in our daily lives. This study aims to propose a speech/music discrimination system which utilizes the advantages of the wavelet transform. Also, the performance of the discrete wavelet transform and the dual- tree wavelet transform has been compared with the conventional time, frequency and cepstral domain features used in speech/music discrimination. The speech and music samples collected from common databases, CD recording and internet radios have been classified with artificial neural networks with different feature sets. The principal component analysis has been applied to eliminate the correlated features before classification stage. Considering the number of vanishing moments and orthogonality, the best performance has been obtained with Daubechies8 wavelet among the other members of the Daubechies family. According to the results, the proposed feature set outperforms the traditional ones.
Keywords: Speech/music discrimination, Discrete wavelet transform, Dual-tree wavelet transform, Daubechies mother wavelet.
Keywords
References
- Ambikairajah, O. M. E., Epps, J., “Novel features for effective speech and music discrimination,” in Proc. IEEE Int. Conf. on Engineering of Intelligent Systems, pp. 1–5, 2006.
- Exposito, N. R. J.E.M., Galan, S.G., Candeas, P., “Audio coding improvement using evolutionary speech/music discrimination,” in Proc. IEEE Int. Conf. on Fuzzy Systems (FUZZ-IEEE), pp. 1–6, 2007.
- El-Maleh, K., Petrucci, M. G., Kabal, P., “Speech/music discrimination for multimedia applications,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 2445–2448, 2000.
- Gedik, A., Bozkurt, B., “Pitch frequency histogram based music information retrieval for turkish music,” Signal Processing, vol. 10, pp. 1049–1063, 2010.
- Saunders, J., “Real time discrimination of broadcast speech/music,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 993–996, 1996.
- Scheier, E., Slaney, M., “Construction and evaluation of a robust multifeature speech/music discriminator,” in Proc. IEEE Int. Conf. On Acoustics, Speech, and Signal Processing, ICASSP’97, pp. 1331–1334, 1997
- Ajmera, I. M. J., Bourlard, H., “Speech/music segmentation using entropy and dynamism features in a HMM classification framework,” Speech Communication, vol. 40, pp. 351–363, 2003.
- Panagiotakis, C., Tziritas, G., “A speech/music discriminator based on RMS and zero-crossings,” IEEE Trans. Multimedia, vol. 7, pp. 155–166, 2005.
Details
Primary Language
English
Subjects
-
Journal Section
-
Publication Date
March 28, 2012
Submission Date
March 28, 2012
Acceptance Date
-
Published in Issue
Year 2011 Volume: 11 Number: 1
APA
Düzenli, T., & Özkurt, N. (2012). Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination. IU-Journal of Electrical & Electronics Engineering, 11(1), 1355-1362. https://izlik.org/JA35FP32MZ
AMA
1.Düzenli T, Özkurt N. Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination. IU-Journal of Electrical & Electronics Engineering. 2012;11(1):1355-1362. https://izlik.org/JA35FP32MZ
Chicago
Düzenli, Timur, and Nalan Özkurt. 2012. “Comparison OF Wavelet Based Feature Extraction Methods for Speech Music Discrimination”. IU-Journal of Electrical & Electronics Engineering 11 (1): 1355-62. https://izlik.org/JA35FP32MZ.
EndNote
Düzenli T, Özkurt N (March 1, 2012) Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination. IU-Journal of Electrical & Electronics Engineering 11 1 1355–1362.
IEEE
[1]T. Düzenli and N. Özkurt, “Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination”, IU-Journal of Electrical & Electronics Engineering, vol. 11, no. 1, pp. 1355–1362, Mar. 2012, [Online]. Available: https://izlik.org/JA35FP32MZ
ISNAD
Düzenli, Timur - Özkurt, Nalan. “Comparison OF Wavelet Based Feature Extraction Methods for Speech Music Discrimination”. IU-Journal of Electrical & Electronics Engineering 11/1 (March 1, 2012): 1355-1362. https://izlik.org/JA35FP32MZ.
JAMA
1.Düzenli T, Özkurt N. Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination. IU-Journal of Electrical & Electronics Engineering. 2012;11:1355–1362.
MLA
Düzenli, Timur, and Nalan Özkurt. “Comparison OF Wavelet Based Feature Extraction Methods for Speech Music Discrimination”. IU-Journal of Electrical & Electronics Engineering, vol. 11, no. 1, Mar. 2012, pp. 1355-62, https://izlik.org/JA35FP32MZ.
Vancouver
1.Timur Düzenli, Nalan Özkurt. Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination. IU-Journal of Electrical & Electronics Engineering [Internet]. 2012 Mar. 1;11(1):1355-62. Available from: https://izlik.org/JA35FP32MZ