KIRPILMIŞ SES İŞARETLERİNİN YENİLENMESİ

Mehmet Erdal Özbek

doi:10.17482/uumfd.912949

Araştırma Makalesi

Restoration of Clipped Audio Signals

Yıl 2021, Cilt: 26 Sayı: 3, 1035 - 1046, 31.12.2021

Mehmet Erdal Özbek

https://doi.org/10.17482/uumfd.912949

Öz

Restoration process is performed to remove degradations formed on the audio signals. In the restoration of clipped audio signals, which is one of these degradations, the degraded section is aimed to be restored to its original by the part of the undegraded section of the signal. The transformation of the signal from as normally given or recorded in the time domain to a different domain and thus reducing the number of samples required to be represented might be possible due to sparse representation. In this study, a restoration method is presented that relies on sparse representation of the discrete Fourier transform coefficients of the signal. In order to evaluate the performance of the proposed method, experiments were performed on various speech and music signal examples. It has been shown that the proposed method achieves better signal to noise ratio performance compared to the other methods in cases of higher clipping ratios.

Anahtar Kelimeler

Audio restoration, Clipping, Sparse representation, Optimization

Kaynakça

1. Abel, J. S. and Smith, J. O. (1991) Restoring a clipped signal, IEEE International Conference on Acoustics, Speech, and Signal Processing, Toronto, 1745-1748. doi: 10.1109/ICASSP.1991.150655
2. Adler, A., Emiya, V., Jafari, M. G., Elad, M., Gribonval, R. and Plumbley, M. D. (2012) Audio inpainting, IEEE Transactions on Audio, Speech, and Language Processing, 20(3), 922-932. doi: 10.1109/TASL.2011.2168211
3. Candes, E. J. (2006) Compressive sampling, International Congress of Mathematicians, Madrid, 1433-1452. doi: 10.4171/022-3/69
4. Chen, S., Donoho, D. and Saunders, M. (1998) Atomic decomposition by basis pursuit, SIAM Journal on Scientific Computing, 20(1), 33-61. doi: 10.1137/S1064827596304010
5. Davis, G., Mallat, S. and Avellaneda, M. (1997) Adaptive greedy approximations, Journal of Constructive Approximation, 13, 57-98. doi: 10.1007/BF02678430
6. Defraene, B., Mansour, N., De Hertogh, S., van Waterschoot, T., Diehl, M. and Moonen, M. (2013) Declipping of audio signals using perceptual compressed sensing, IEEE Transactions on Audio, Speech, and Language Processing, 21(12), 2627-2637. doi: 10.1109/TASL.2013.2281570
7. Deruty, E. and Tardieu, D. (2014) About dynamic processing in mainstream music, Journal of Audio Engineering Society, 62(1/2), 42-55. doi: 10.17743/jaes.2014.0001
8. Donoho, D. (2006) Compressed sensing, IEEE Transactions on Information Theory, 52(4), 1289-1306. doi: 10.1109/TIT.2006.871582
9. Gaultier, C., Kitić, S., Gribonval, R. and Bertin, N. (2021) Sparsity-based audio declipping methods: Selected overview, new algorithms, and large-scale evaluation, IEEE/ACM Transactions on Audio, Speech and Language Processing, 29, 1174-1187. doi: 10.1109/TASLP.2021.3059264
10. Godsill, S. J. and Rayner, P. J. W. (1998) Digital Audio Restoration - A Statistical Model-Based Approach. London: Springer-Verlag. doi: 10.1007/978-1-4471-1561-8
11. Janssen, A. J. E. M., Veldhuis, R. N. J. and Vries, L. B. (1986) Adaptive interpolation of discrete-time signals that can be modeled as autoregressive processes, IEEE Transactions on Acoustics, Speech, and Signal Processing, 34(2), 317-330. doi: 10.1109/TASSP.1986.1164824
12. Kitić S., Bertin N. and Gribonval R. (2015) Sparsity and cosparsity for audio declipping: A flexible non-convex approach. in: Vincent E., Yeredor A., Koldovský Z., Tichavský P. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2015. Lecture Notes in Computer Science, 9237, 243-250. doi: 10.1007/978-3-319- 22482-4_28
13. Mallat, S. and Zhang, Z. (1993) Matching pursuits with time-frequency dictionaries, IEEE Transactions on Signal Processing, 41(12), 3397-3415. doi: 10.1109/78.258082
14. Mokrý, O. and Rajmic, P. (2020) Audio inpainting: Revisited and reweighted, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, 2906-2918. doi: 10.1109/TASLP.2020.3030486
15. Mokrý, O., Záviška, P., Rajmic, P. and Veselý, V. (2019) Introducing SPAIN (SParse Audio INpainter), 27th European Signal Processing Conference (EUSIPCO), A Coruna, 1-5. doi: 10.23919/EUSIPCO.2019.8902560
16. Needell, D. (2009) Noisy signal recovery via iterative reweighted L1-minimization, Forty-Third Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 113-117. doi: 10.1109/ACSSC.2009.5470154
17. Orcalli, A. (2001) On the methodologies of audio restoration, Journal of New Music Research, 30(4), 307-322. doi: 10.1076/jnmr.30.4.307.7496
18. Ozerov, A., Bilen, Ç. and Pérez, P. (2016) Multichannel audio declipping, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 659-663. doi: 10.1109/ICASSP.2016.7471757
19. Plumbley, M. D., Blumensath, T., Daudet, L., Gribonval, R. and Davies, M. E. (2010) Sparse representations in audio and music: From coding to source separation, Proceedings of the IEEE, 98(6), 995-1005. doi: 10.1109/JPROC.2009.2030345
20. Tauböck, G., Rajbamshi, S. and Balazs, P. (2021) Dictionary learning for sparse audio inpainting, IEEE Journal of Selected Topics in Signal Processing, 15(1), 104-119. doi: 10.1109/JSTSP.2020.3046422
21. Weinstein, A. J. and Wakin, M. B. (2011) Recovering a clipped signal in sparseland. Erişim Adresi: https://arxiv.org/pdf/1110.5063 (Erişim Tarihi: 02.09.2021).
22. Záviška, P., Rajmic, P., Ozerov, A. and Rencker, L. (2021) A survey and an extensive evaluation of popular audio declipping methods, IEEE Journal of Selected Topics in Signal Processing, 15(1), 5-24. doi: 10.1109/JSTSP.2020.3042071

KIRPILMIŞ SES İŞARETLERİNİN YENİLENMESİ

Yıl 2021, Cilt: 26 Sayı: 3, 1035 - 1046, 31.12.2021

Mehmet Erdal Özbek

https://doi.org/10.17482/uumfd.912949

Öz

Ses işaretlerinde oluşan bozulmaların ortadan kaldırılması için yenileme işlemi yapılmaktadır. Bu bozulmalardan birisi olan kırpılmış ses işaretlerinin yenileme işleminde, işaretin bozulmamış bölgesindeki işaret parçası aracılığı ile işaretin bozulmaya uğramış bölgesinin özgün durumuna geri getirilmesi amaçlanmaktadır. İşaretin normal olarak verildiği ya da kayıt edildiği zaman ortamından farklı bir ortama dönüştürülmesi ve bu sayede temsil edilmesi için gerekli örnek sayısının azalması seyrek gösterim sayesinde mümkün olmaktadır. Bu çalışmada işaretin ayrık Fourier dönüşümü katsayılarının oluşturduğu seyrek gösterime dayanan bir yenileme yöntemi sunulmaktadır. Önerilen yöntemin başarımının değerlendirilmesi için farklı konuşma ve müzik işaretlerinden oluşan örnekler üzerinde çalışmalar yapılmıştır. Önerilen yöntemin işaretin daha yüksek oranda kırpılması durumunda karşılaştırılan diğer yöntemlere göre daha iyi işaret gürültü oranı başarımı elde ettiği gösterilmiştir.

Anahtar Kelimeler

Ses yenileme, Kırpılma, Seyrek gösterim, Eniyileme

Kaynakça

1. Abel, J. S. and Smith, J. O. (1991) Restoring a clipped signal, IEEE International Conference on Acoustics, Speech, and Signal Processing, Toronto, 1745-1748. doi: 10.1109/ICASSP.1991.150655
2. Adler, A., Emiya, V., Jafari, M. G., Elad, M., Gribonval, R. and Plumbley, M. D. (2012) Audio inpainting, IEEE Transactions on Audio, Speech, and Language Processing, 20(3), 922-932. doi: 10.1109/TASL.2011.2168211
3. Candes, E. J. (2006) Compressive sampling, International Congress of Mathematicians, Madrid, 1433-1452. doi: 10.4171/022-3/69
4. Chen, S., Donoho, D. and Saunders, M. (1998) Atomic decomposition by basis pursuit, SIAM Journal on Scientific Computing, 20(1), 33-61. doi: 10.1137/S1064827596304010
5. Davis, G., Mallat, S. and Avellaneda, M. (1997) Adaptive greedy approximations, Journal of Constructive Approximation, 13, 57-98. doi: 10.1007/BF02678430
6. Defraene, B., Mansour, N., De Hertogh, S., van Waterschoot, T., Diehl, M. and Moonen, M. (2013) Declipping of audio signals using perceptual compressed sensing, IEEE Transactions on Audio, Speech, and Language Processing, 21(12), 2627-2637. doi: 10.1109/TASL.2013.2281570
7. Deruty, E. and Tardieu, D. (2014) About dynamic processing in mainstream music, Journal of Audio Engineering Society, 62(1/2), 42-55. doi: 10.17743/jaes.2014.0001
8. Donoho, D. (2006) Compressed sensing, IEEE Transactions on Information Theory, 52(4), 1289-1306. doi: 10.1109/TIT.2006.871582
9. Gaultier, C., Kitić, S., Gribonval, R. and Bertin, N. (2021) Sparsity-based audio declipping methods: Selected overview, new algorithms, and large-scale evaluation, IEEE/ACM Transactions on Audio, Speech and Language Processing, 29, 1174-1187. doi: 10.1109/TASLP.2021.3059264
10. Godsill, S. J. and Rayner, P. J. W. (1998) Digital Audio Restoration - A Statistical Model-Based Approach. London: Springer-Verlag. doi: 10.1007/978-1-4471-1561-8
11. Janssen, A. J. E. M., Veldhuis, R. N. J. and Vries, L. B. (1986) Adaptive interpolation of discrete-time signals that can be modeled as autoregressive processes, IEEE Transactions on Acoustics, Speech, and Signal Processing, 34(2), 317-330. doi: 10.1109/TASSP.1986.1164824
12. Kitić S., Bertin N. and Gribonval R. (2015) Sparsity and cosparsity for audio declipping: A flexible non-convex approach. in: Vincent E., Yeredor A., Koldovský Z., Tichavský P. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2015. Lecture Notes in Computer Science, 9237, 243-250. doi: 10.1007/978-3-319- 22482-4_28
13. Mallat, S. and Zhang, Z. (1993) Matching pursuits with time-frequency dictionaries, IEEE Transactions on Signal Processing, 41(12), 3397-3415. doi: 10.1109/78.258082
14. Mokrý, O. and Rajmic, P. (2020) Audio inpainting: Revisited and reweighted, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, 2906-2918. doi: 10.1109/TASLP.2020.3030486
15. Mokrý, O., Záviška, P., Rajmic, P. and Veselý, V. (2019) Introducing SPAIN (SParse Audio INpainter), 27th European Signal Processing Conference (EUSIPCO), A Coruna, 1-5. doi: 10.23919/EUSIPCO.2019.8902560
16. Needell, D. (2009) Noisy signal recovery via iterative reweighted L1-minimization, Forty-Third Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 113-117. doi: 10.1109/ACSSC.2009.5470154
17. Orcalli, A. (2001) On the methodologies of audio restoration, Journal of New Music Research, 30(4), 307-322. doi: 10.1076/jnmr.30.4.307.7496
18. Ozerov, A., Bilen, Ç. and Pérez, P. (2016) Multichannel audio declipping, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 659-663. doi: 10.1109/ICASSP.2016.7471757
19. Plumbley, M. D., Blumensath, T., Daudet, L., Gribonval, R. and Davies, M. E. (2010) Sparse representations in audio and music: From coding to source separation, Proceedings of the IEEE, 98(6), 995-1005. doi: 10.1109/JPROC.2009.2030345
20. Tauböck, G., Rajbamshi, S. and Balazs, P. (2021) Dictionary learning for sparse audio inpainting, IEEE Journal of Selected Topics in Signal Processing, 15(1), 104-119. doi: 10.1109/JSTSP.2020.3046422
21. Weinstein, A. J. and Wakin, M. B. (2011) Recovering a clipped signal in sparseland. Erişim Adresi: https://arxiv.org/pdf/1110.5063 (Erişim Tarihi: 02.09.2021).
22. Záviška, P., Rajmic, P., Ozerov, A. and Rencker, L. (2021) A survey and an extensive evaluation of popular audio declipping methods, IEEE Journal of Selected Topics in Signal Processing, 15(1), 5-24. doi: 10.1109/JSTSP.2020.3042071

Toplam 22 adet kaynakça vardır.

Ayrıntılar

Birincil Dil	Türkçe
Konular	Elektrik Mühendisliği
Bölüm	Araştırma Makaleleri
Yazarlar	Mehmet Erdal Özbek 0000-0001-5840-7960
Yayımlanma Tarihi	31 Aralık 2021
Gönderilme Tarihi	10 Nisan 2021
Kabul Tarihi	13 Kasım 2021
Yayımlandığı Sayı	Yıl 2021 Cilt: 26 Sayı: 3

Kaynak Göster

APA	Özbek, M. E. (2021). KIRPILMIŞ SES İŞARETLERİNİN YENİLENMESİ. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi, 26(3), 1035-1046. https://doi.org/10.17482/uumfd.912949
AMA	Özbek ME. KIRPILMIŞ SES İŞARETLERİNİN YENİLENMESİ. UUJFE. Aralık 2021;26(3):1035-1046. doi:10.17482/uumfd.912949
Chicago	Özbek, Mehmet Erdal. “KIRPILMIŞ SES İŞARETLERİNİN YENİLENMESİ”. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi 26, sy. 3 (Aralık 2021): 1035-46. https://doi.org/10.17482/uumfd.912949.
EndNote	Özbek ME (01 Aralık 2021) KIRPILMIŞ SES İŞARETLERİNİN YENİLENMESİ. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi 26 3 1035–1046.
IEEE	M. E. Özbek, “KIRPILMIŞ SES İŞARETLERİNİN YENİLENMESİ”, UUJFE, c. 26, sy. 3, ss. 1035–1046, 2021, doi: 10.17482/uumfd.912949.
ISNAD	Özbek, Mehmet Erdal. “KIRPILMIŞ SES İŞARETLERİNİN YENİLENMESİ”. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi 26/3 (Aralık 2021), 1035-1046. https://doi.org/10.17482/uumfd.912949.
JAMA	Özbek ME. KIRPILMIŞ SES İŞARETLERİNİN YENİLENMESİ. UUJFE. 2021;26:1035–1046.
MLA	Özbek, Mehmet Erdal. “KIRPILMIŞ SES İŞARETLERİNİN YENİLENMESİ”. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi, c. 26, sy. 3, 2021, ss. 1035-46, doi:10.17482/uumfd.912949.
Vancouver	Özbek ME. KIRPILMIŞ SES İŞARETLERİNİN YENİLENMESİ. UUJFE. 2021;26(3):1035-46.

Makale Dosyaları

Tam Metin

DUYURU:

30.03.2021- Nisan 2021 (26/1) sayımızdan itibaren TR-Dizin yeni kuralları gereği, dergimizde basılacak makalelerde, ilk gönderim aşamasında Telif Hakkı Formu yanısıra, Çıkar Çatışması Bildirim Formu ve Yazar Katkısı Bildirim Formu da tüm yazarlarca imzalanarak gönderilmelidir. Yayınlanacak makalelerde de makale metni içinde "Çıkar Çatışması" ve "Yazar Katkısı" bölümleri yer alacaktır. İlk gönderim aşamasında doldurulması gereken yeni formlara "Yazım Kuralları" ve "Makale Gönderim Süreci" sayfalarımızdan ulaşılabilir. (Değerlendirme süreci bu tarihten önce tamamlanıp basımı bekleyen makalelerin yanısıra değerlendirme süreci devam eden makaleler için, yazarlar tarafından ilgili formlar doldurularak sisteme yüklenmelidir). Makale şablonları da, bu değişiklik doğrultusunda güncellenmiştir. Tüm yazarlarımıza önemle duyurulur.

Bursa Uludağ Üniversitesi, Mühendislik Fakültesi Dekanlığı, Görükle Kampüsü, Nilüfer, 16059 Bursa. Tel: (224) 294 1907, Faks: (224) 294 1903, e-posta: mmfd@uludag.edu.tr