Ambient sound analysis has become more prominent with the rise of portable and wearable devices. It provides valuable insights into a person's environment by analyzing surrounding sounds. Recently, deep learning methods, frequently used in image and text processing, have been applied to this field and are proving more effective than traditional machine learning techniques.
In this study, we evaluated the performance of different deep learning models using mel-spectrograms of 3 classes of stage sounds based on TAU Acoustic Scene 2023 dataset. Our results indicate that a simple Convolutional Neural Network (CNN) model gives better classification results compared to other more complex models in classification tasks. Despite having the fewest parameters, the CNN model achieved the highest success with 59% accuracy. This suggests that simpler models can be highly effective for acoustic scene classification, highlighting the value of more efficient and computationally feasible approaches in this domain
Signal processing deep learning acoustic scene classification audio processing model performance
Birincil Dil | İngilizce |
---|---|
Konular | Sinyal İşleme |
Bölüm | Tasarım ve Teknoloji |
Yazarlar | |
Erken Görünüm Tarihi | 2 Temmuz 2025 |
Yayımlanma Tarihi | |
Gönderilme Tarihi | 15 Kasım 2024 |
Kabul Tarihi | 4 Haziran 2025 |
Yayımlandığı Sayı | Yıl 2025 Cilt: 13 Sayı: 3 |