Research Article

Effect of Spectrogram Parameters and Noise Types on The Performance of Spectro-temporal Peaks Based Audio Search Method

Volume: 36 Number: 2 June 1, 2023
EN

Effect of Spectrogram Parameters and Noise Types on The Performance of Spectro-temporal Peaks Based Audio Search Method

Abstract

Audio search algorithms are used to detect the queried file in large databases, especially in multimedia applications. These algorithms are expected to perform the detection in a reliable and robust way within the shortest time. In this study, based on spectral peaks method, an audio fingerprint algorithm with a few minor modifications was developed to detect the matching audio file in target database. This method has two stages as the audio fingerprint extraction and matching. In the first stage, fingerprint features are extracted from spectral peaks on the spectrograms of audio files by hash functions. This state-of-art technique reduces the processing load and time considerably compared to traditional methods. In the second stage, fingerprint data of the queried file are compared with the data created in the first stage in the database. The algorithm was demonstrated, and the effect of spectrogram parameters (window size, overlap, number of FFT) was investigated by considering reliability and robustness under different noise sources. Also, it was aimed to contribute to new audio retrieval studies based on spectral peaks method. It was observed that the variation in the spectrogram parameters significantly affected the number of matchings, reliability and robustness. Under high noise conditions, the optimal spectrogram parameters were determined as 512 (window size), 50% (overlap), 512 (number of FFT). It was seen in general that the algorithm successfully detected the queried file in the database even in high noise conditions for these parameters. No significant effect of music genre was observed.  

Keywords

References

  1. [1] Grosche, P., Müller, M., Serra, J., "Audio Content-Based Music Retrieval", in: M. Müller, M. Goto, M. Schedl (Eds.), Multimodal Music Processing, Dagstuhl Follow-Ups, 157–174, (2012).
  2. [2] Casey, M.A., Veltkamp, R., Goto, M., Leman, M., Rhodes, C., Slaney, M., "Content-Based Music Information Retrieval: Current Directions and Future Challenges", Proceedings of the IEEE, 96 (4): 668–696, (2008).
  3. [3] Cano, P., Batlle, E., Kalker, T., Haitsma, J., "A Review of Audio Fingerprinting", Journal of VLSI Signal Processing Systems for Signal, Image and Video Technology, 41(3): 271–284, (2005).
  4. [4] Cano, P., Battle, E., Mayer, H., Neuschmied, H., "Robust Sound Modeling for Song Detection in Broadcast Audio", AES 112th Convention, Munich, 1–7, (2002).
  5. [5] Haitsma, J., Kalker, T., "A Highly Robust Audio Fingerprinting System", International Conference on Music Information Retrieval, Paris, 1–9, (2002).
  6. [6] Haitsma, J., Kalker, T., "Speed-change resistant audio fingerprinting using auto-correlation", IEEE International Conference on Acoustics, Speech, and Signal Processing, IV-728–31, (2003).
  7. [7] Cremer, M., Froba, B., Hellmuth, O., Herre, J., Allamanche, E., "AudioID: Towards Content-Based Identification of Audio Material", AES 110th Convention, Amsterdam, (2001).
  8. [8] Fenet, S., Richard, G., Grenier, Y., "A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting", 12th International Society for Music Information Retrieval Conference, Miami, 121–126, (2011).

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

June 1, 2023

Submission Date

September 25, 2021

Acceptance Date

May 27, 2022

Published in Issue

Year 2023 Volume: 36 Number: 2

APA
Köseoğlu, M., & Uyanık, H. (2023). Effect of Spectrogram Parameters and Noise Types on The Performance of Spectro-temporal Peaks Based Audio Search Method. Gazi University Journal of Science, 36(2), 624-643. https://doi.org/10.35378/gujs.1000594
AMA
1.Köseoğlu M, Uyanık H. Effect of Spectrogram Parameters and Noise Types on The Performance of Spectro-temporal Peaks Based Audio Search Method. Gazi University Journal of Science. 2023;36(2):624-643. doi:10.35378/gujs.1000594
Chicago
Köseoğlu, Murat, and Hakan Uyanık. 2023. “Effect of Spectrogram Parameters and Noise Types on The Performance of Spectro-Temporal Peaks Based Audio Search Method”. Gazi University Journal of Science 36 (2): 624-43. https://doi.org/10.35378/gujs.1000594.
EndNote
Köseoğlu M, Uyanık H (June 1, 2023) Effect of Spectrogram Parameters and Noise Types on The Performance of Spectro-temporal Peaks Based Audio Search Method. Gazi University Journal of Science 36 2 624–643.
IEEE
[1]M. Köseoğlu and H. Uyanık, “Effect of Spectrogram Parameters and Noise Types on The Performance of Spectro-temporal Peaks Based Audio Search Method”, Gazi University Journal of Science, vol. 36, no. 2, pp. 624–643, June 2023, doi: 10.35378/gujs.1000594.
ISNAD
Köseoğlu, Murat - Uyanık, Hakan. “Effect of Spectrogram Parameters and Noise Types on The Performance of Spectro-Temporal Peaks Based Audio Search Method”. Gazi University Journal of Science 36/2 (June 1, 2023): 624-643. https://doi.org/10.35378/gujs.1000594.
JAMA
1.Köseoğlu M, Uyanık H. Effect of Spectrogram Parameters and Noise Types on The Performance of Spectro-temporal Peaks Based Audio Search Method. Gazi University Journal of Science. 2023;36:624–643.
MLA
Köseoğlu, Murat, and Hakan Uyanık. “Effect of Spectrogram Parameters and Noise Types on The Performance of Spectro-Temporal Peaks Based Audio Search Method”. Gazi University Journal of Science, vol. 36, no. 2, June 2023, pp. 624-43, doi:10.35378/gujs.1000594.
Vancouver
1.Murat Köseoğlu, Hakan Uyanık. Effect of Spectrogram Parameters and Noise Types on The Performance of Spectro-temporal Peaks Based Audio Search Method. Gazi University Journal of Science. 2023 Jun. 1;36(2):624-43. doi:10.35378/gujs.1000594

Cited By