Araştırma Makalesi

Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network

Cilt: 10 Sayı: 3 30 Temmuz 2022
PDF İndir
EN

Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network

Öz

Semantic segmentation is a fundamental problem for computer vision. On the other hand, for studies in the field of biometrics, semantic segmentation is gaining more importance. Many successful biometric recognition systems require a high- performance semantic segmentation algorithm. In this study, we present an effective ear segmentation technique in natural images. A convolutional neural network is trained for pixel-based ear segmentation. DeepLab v3+ network structure, with ResNet-18 as the backbone and Tversky lost function layer as the last layer, has been trained with natural and uncontrolled images. We perform the proposed network training using only the 750 images in the Annotated Web Ears (AWE) training set. The corresponding tests are performed on the AWE Test Set, University of Ljubljana Test Set, and the Collection A of In-The-Wild dataset. For the Annotated Web Ears (AWE) dataset, intersection over union (IoU) is measured as 86.3% for the AWE database. To the best of our knowledge, this is the highest performance achieved among the algorithms tested on the AWE test set.

Anahtar Kelimeler

Kaynakça

  1. [1] A. Abaza, A. Ross, C. Hebert, M. A. F. Harrison, and M. S.Nixon, “A survey on ear biometrics,” ACM Computing Surveys,vol. 45, no. 2, pp. 1–35, Feb. 2013, number: 2 Reporter: ACMComputing Surveys. [Online]. Available: http://dl.acm.org/citation.cfm?doid=2431211.2431221
  2. [2] A. Pflug and C. Busch, “Ear biometrics: a survey of detection,feature extraction and recognition methods,” IET Biometrics, vol. 1,no. 2, pp. 114–129, Jun. 2012, number: 2 Reporter: IET Biometrics.[Online]. Available: https://digital-library.theiet.org/content/journals/10.1049/iet-bmt.2011.0003
  3. [3] Z. Emersic, D. Stepec, V. Struc, P. Peer, A. George, A. Ahmad, E. Omar,T. E. Boult, R. Safdaii, Y. Zhou, S. Zafeiriou, D. Yaman, F. I. Eyiokur,and H. K. Ekenel, “The unconstrained ear recognition challenge,” in2017 IEEE International Joint Conference on Biometrics (IJCB), Oct.2017, pp. 715–724, meeting Name: 2017 IEEE International JointConference on Biometrics (IJCB) Reporter: 2017 IEEE InternationalJoint Conference on Biometrics (IJCB) ISSN: 2474-9699.
  4. [4] Z. Emersic, A. K. S. V, B. S. Harish, W. Gutfeter, J. N. Khiarak,A. Pacut, E. Hansley, M. P. Segundo, S. Sarkar, H. J. Park, G. P. Nam, I.-J. Kim, S. G. Sangodkar, U. Kacar, M. Kirci, L. Yuan, J. Yuan, H. Zhao,F. Lu, J. Mao, X. Zhang, D. Yaman, F. I. Eyiokur, K. B. Özler, H. K.Ekenel, D. P. Chowdhury, S. Bakshi, P. K. Sa, B. Majhi, P. Peer, andV. Štruc, “The Unconstrained Ear Recognition Challenge 2019,” in 2019International Conference on Biometrics (ICB), 2019, pp. 1–15.
  5. [5] Z. Emersic, J. Krizaj, V. Struc, and P. Peer, “Deep Ear RecognitionPipeline,” in Recent Advances in Computer Vision: Theories andApplications, ser. Studies in Computational Intelligence, M. Hassaballahand K. M. Hosny, Eds. Cham: Springer International Publishing,2019, pp. 333–362, reporter: Recent Advances in Computer Vision:Theories and Applications. [Online]. Available: https://doi.org/10.1007/978-3-030-03000-1 14
  6. [6] Z. Zou, Z. Shi, Y. Guo, and J. Ye, “Object Detection in 20Years: A Survey,” arXiv:1905.05055 [cs], May 2019, reporter:arXiv:1905.05055 [cs] arXiv: 1905.05055. [Online]. Available: http://arxiv.org/abs/1905.05055
  7. [7] A. Paszke, A. Chaurasia, S. Kim, and E. Culurciello, “ENet: A DeepNeural Network Architecture for Real-Time Semantic Segmentation,”arXiv:1606.02147 [cs], Jun. 2016, reporter: arXiv:1606.02147 [cs]arXiv: 1606.02147. [Online]. Available: http://arxiv.org/abs/1606.02147
  8. [8] L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L.Yuille, “Semantic Image Segmentation with Deep ConvolutionalNets and Fully Connected CRFs,” arXiv:1412.7062 [cs], Jun. 2016,reporter: arXiv:1412.7062 [cs] arXiv: 1412.7062. [Online]. Available:http://arxiv.org/abs/1412.7062

Ayrıntılar

Birincil Dil

İngilizce

Konular

Yapay Zeka

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

30 Temmuz 2022

Gönderilme Tarihi

15 Kasım 2021

Kabul Tarihi

18 Temmuz 2022

Yayımlandığı Sayı

Yıl 2022 Cilt: 10 Sayı: 3

Kaynak Göster

APA
Inan, T., & Kacar, U. (2022). Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network. Balkan Journal of Electrical and Computer Engineering, 10(3), 337-346. https://doi.org/10.17694/bajece.1024073
AMA
1.Inan T, Kacar U. Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network. Balkan Journal of Electrical and Computer Engineering. 2022;10(3):337-346. doi:10.17694/bajece.1024073
Chicago
Inan, Tolga, ve Umit Kacar. 2022. “Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network”. Balkan Journal of Electrical and Computer Engineering 10 (3): 337-46. https://doi.org/10.17694/bajece.1024073.
EndNote
Inan T, Kacar U (01 Temmuz 2022) Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network. Balkan Journal of Electrical and Computer Engineering 10 3 337–346.
IEEE
[1]T. Inan ve U. Kacar, “Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network”, Balkan Journal of Electrical and Computer Engineering, c. 10, sy 3, ss. 337–346, Tem. 2022, doi: 10.17694/bajece.1024073.
ISNAD
Inan, Tolga - Kacar, Umit. “Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network”. Balkan Journal of Electrical and Computer Engineering 10/3 (01 Temmuz 2022): 337-346. https://doi.org/10.17694/bajece.1024073.
JAMA
1.Inan T, Kacar U. Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network. Balkan Journal of Electrical and Computer Engineering. 2022;10:337–346.
MLA
Inan, Tolga, ve Umit Kacar. “Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network”. Balkan Journal of Electrical and Computer Engineering, c. 10, sy 3, Temmuz 2022, ss. 337-46, doi:10.17694/bajece.1024073.
Vancouver
1.Tolga Inan, Umit Kacar. Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network. Balkan Journal of Electrical and Computer Engineering. 01 Temmuz 2022;10(3):337-46. doi:10.17694/bajece.1024073

All articles published by BAJECE are licensed under the Creative Commons Attribution 4.0 International License. This permits anyone to copy, redistribute, remix, transmit and adapt the work provided the original work and source is appropriately cited.Creative Commons Lisans