Research Article

Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network

Volume: 10 Number: 3 July 30, 2022
EN

Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network

Abstract

Semantic segmentation is a fundamental problem for computer vision. On the other hand, for studies in the field of biometrics, semantic segmentation is gaining more importance. Many successful biometric recognition systems require a high- performance semantic segmentation algorithm. In this study, we present an effective ear segmentation technique in natural images. A convolutional neural network is trained for pixel-based ear segmentation. DeepLab v3+ network structure, with ResNet-18 as the backbone and Tversky lost function layer as the last layer, has been trained with natural and uncontrolled images. We perform the proposed network training using only the 750 images in the Annotated Web Ears (AWE) training set. The corresponding tests are performed on the AWE Test Set, University of Ljubljana Test Set, and the Collection A of In-The-Wild dataset. For the Annotated Web Ears (AWE) dataset, intersection over union (IoU) is measured as 86.3% for the AWE database. To the best of our knowledge, this is the highest performance achieved among the algorithms tested on the AWE test set.

Keywords

References

  1. [1] A. Abaza, A. Ross, C. Hebert, M. A. F. Harrison, and M. S.Nixon, “A survey on ear biometrics,” ACM Computing Surveys,vol. 45, no. 2, pp. 1–35, Feb. 2013, number: 2 Reporter: ACMComputing Surveys. [Online]. Available: http://dl.acm.org/citation.cfm?doid=2431211.2431221
  2. [2] A. Pflug and C. Busch, “Ear biometrics: a survey of detection,feature extraction and recognition methods,” IET Biometrics, vol. 1,no. 2, pp. 114–129, Jun. 2012, number: 2 Reporter: IET Biometrics.[Online]. Available: https://digital-library.theiet.org/content/journals/10.1049/iet-bmt.2011.0003
  3. [3] Z. Emersic, D. Stepec, V. Struc, P. Peer, A. George, A. Ahmad, E. Omar,T. E. Boult, R. Safdaii, Y. Zhou, S. Zafeiriou, D. Yaman, F. I. Eyiokur,and H. K. Ekenel, “The unconstrained ear recognition challenge,” in2017 IEEE International Joint Conference on Biometrics (IJCB), Oct.2017, pp. 715–724, meeting Name: 2017 IEEE International JointConference on Biometrics (IJCB) Reporter: 2017 IEEE InternationalJoint Conference on Biometrics (IJCB) ISSN: 2474-9699.
  4. [4] Z. Emersic, A. K. S. V, B. S. Harish, W. Gutfeter, J. N. Khiarak,A. Pacut, E. Hansley, M. P. Segundo, S. Sarkar, H. J. Park, G. P. Nam, I.-J. Kim, S. G. Sangodkar, U. Kacar, M. Kirci, L. Yuan, J. Yuan, H. Zhao,F. Lu, J. Mao, X. Zhang, D. Yaman, F. I. Eyiokur, K. B. Özler, H. K.Ekenel, D. P. Chowdhury, S. Bakshi, P. K. Sa, B. Majhi, P. Peer, andV. Štruc, “The Unconstrained Ear Recognition Challenge 2019,” in 2019International Conference on Biometrics (ICB), 2019, pp. 1–15.
  5. [5] Z. Emersic, J. Krizaj, V. Struc, and P. Peer, “Deep Ear RecognitionPipeline,” in Recent Advances in Computer Vision: Theories andApplications, ser. Studies in Computational Intelligence, M. Hassaballahand K. M. Hosny, Eds. Cham: Springer International Publishing,2019, pp. 333–362, reporter: Recent Advances in Computer Vision:Theories and Applications. [Online]. Available: https://doi.org/10.1007/978-3-030-03000-1 14
  6. [6] Z. Zou, Z. Shi, Y. Guo, and J. Ye, “Object Detection in 20Years: A Survey,” arXiv:1905.05055 [cs], May 2019, reporter:arXiv:1905.05055 [cs] arXiv: 1905.05055. [Online]. Available: http://arxiv.org/abs/1905.05055
  7. [7] A. Paszke, A. Chaurasia, S. Kim, and E. Culurciello, “ENet: A DeepNeural Network Architecture for Real-Time Semantic Segmentation,”arXiv:1606.02147 [cs], Jun. 2016, reporter: arXiv:1606.02147 [cs]arXiv: 1606.02147. [Online]. Available: http://arxiv.org/abs/1606.02147
  8. [8] L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L.Yuille, “Semantic Image Segmentation with Deep ConvolutionalNets and Fully Connected CRFs,” arXiv:1412.7062 [cs], Jun. 2016,reporter: arXiv:1412.7062 [cs] arXiv: 1412.7062. [Online]. Available:http://arxiv.org/abs/1412.7062

Details

Primary Language

English

Subjects

Artificial Intelligence

Journal Section

Research Article

Publication Date

July 30, 2022

Submission Date

November 15, 2021

Acceptance Date

July 18, 2022

Published in Issue

Year 2022 Volume: 10 Number: 3

APA
Inan, T., & Kacar, U. (2022). Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network. Balkan Journal of Electrical and Computer Engineering, 10(3), 337-346. https://doi.org/10.17694/bajece.1024073
AMA
1.Inan T, Kacar U. Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network. Balkan Journal of Electrical and Computer Engineering. 2022;10(3):337-346. doi:10.17694/bajece.1024073
Chicago
Inan, Tolga, and Umit Kacar. 2022. “Ear Semantic Segmentation in Natural Images With Tversky Loss Function Supported DeepLabv3+ Convolutional Neural Network”. Balkan Journal of Electrical and Computer Engineering 10 (3): 337-46. https://doi.org/10.17694/bajece.1024073.
EndNote
Inan T, Kacar U (July 1, 2022) Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network. Balkan Journal of Electrical and Computer Engineering 10 3 337–346.
IEEE
[1]T. Inan and U. Kacar, “Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network”, Balkan Journal of Electrical and Computer Engineering, vol. 10, no. 3, pp. 337–346, July 2022, doi: 10.17694/bajece.1024073.
ISNAD
Inan, Tolga - Kacar, Umit. “Ear Semantic Segmentation in Natural Images With Tversky Loss Function Supported DeepLabv3+ Convolutional Neural Network”. Balkan Journal of Electrical and Computer Engineering 10/3 (July 1, 2022): 337-346. https://doi.org/10.17694/bajece.1024073.
JAMA
1.Inan T, Kacar U. Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network. Balkan Journal of Electrical and Computer Engineering. 2022;10:337–346.
MLA
Inan, Tolga, and Umit Kacar. “Ear Semantic Segmentation in Natural Images With Tversky Loss Function Supported DeepLabv3+ Convolutional Neural Network”. Balkan Journal of Electrical and Computer Engineering, vol. 10, no. 3, July 2022, pp. 337-46, doi:10.17694/bajece.1024073.
Vancouver
1.Tolga Inan, Umit Kacar. Ear semantic segmentation in natural images with Tversky loss function supported DeepLabv3+ convolutional neural network. Balkan Journal of Electrical and Computer Engineering. 2022 Jul. 1;10(3):337-46. doi:10.17694/bajece.1024073

All articles published by BAJECE are licensed under the Creative Commons Attribution 4.0 International License. This permits anyone to copy, redistribute, remix, transmit and adapt the work provided the original work and source is appropriately cited.Creative Commons Lisansı