Ambient sound analysis has become more prominent with the rise of portable and wearable devices. It provides valuable insights into a person's environment by analyzing surrounding sounds. Recently, deep learning methods, frequently used in image and text processing, have been applied to this field and are proving more effective than traditional machine learning techniques.
In this study, we evaluated the performance of different deep learning models using mel-spectrograms of 3 classes of stage sounds based on TAU Acoustic Scene 2023 dataset. Our results indicate that a simple Convolutional Neural Network (CNN) model gives better classification results compared to other more complex models in classification tasks. Despite having the fewest parameters, the CNN model achieved the highest success with 59% accuracy. This suggests that simpler models can be highly effective for acoustic scene classification, highlighting the value of more efficient and computationally feasible approaches in this domain
Signal processing deep learning acoustic scene classification audio processing model performance
Primary Language | English |
---|---|
Subjects | Signal Processing |
Journal Section | Tasarım ve Teknoloji |
Authors | |
Early Pub Date | July 2, 2025 |
Publication Date | September 30, 2025 |
Submission Date | November 15, 2024 |
Acceptance Date | June 4, 2025 |
Published in Issue | Year 2025 Volume: 13 Issue: 3 |