Research Article

Using Up-to-Date GAN Methods for Aerial Images

Volume: 15 Number: 1 March 29, 2024
TR EN

Using Up-to-Date GAN Methods for Aerial Images

Abstract

Object detection and segmentation in aerial images is currently a vibrant and significant field of research. The iSAID dataset has been created for object detection in images captured by aerial vehicles. In this study, image semantic segmentation was performed on the iSAID dataset using Generative Adversarial Networks (GANs). The compared GAN methods are CycleGAN, DCLGAN, SimDCL, and SSimDCL. All methods operate on unpaired images. DCLGAN and SimDCL methods are derived by taking inspiration from the CycleGAN method. In these methods, cost functions and network structures vary. This study thoroughly examines the methods, and their similarities and differences are observed. After semantic segmentation is performed, the results are presented using both visual and measurement metrics. Measurement metrics such as FID, KID, PSNR, FSIM, SSIM, and MAE are used. Experimental studies show that SSimDCL and SimDCL methods outperform other methods in iSAID image semantic segmentation. CycleGAN method, on the other hand, is observed to be less successful compared to other methods. The aim of this study is to perform automatic semantic segmentation in aerial images.

Keywords

References

  1. [1] R. S. A. V. K. V. Shrimali, "Current trends in segmentation of medical ultrasound B-mode images: A review," IETE Tech. Rev., cilt 1, no. 817, ss. 26, 2009.
  2. [2] G. Hu and Mageras, "Survey of recent volumetric medical image segmentation techniques," Biomedical Engineering, Vukovar, Croatia: In-Tech, ss. 3216, 2009.
  3. [3] A. A. Taha and A. Hanbury, "Metrics for evaluating 3D medical image segmentation: analysis, selection, and tool," BMC medical imaging, cilt 15, no. 1, ss. 1-28, 2015.
  4. [4] G. Wang, W. Li, M. A. Zuluaga, R. Pratt, P. A. Patel, M. Aertsen et al., "Interactive medical image segmentation using deep learning with image-specific fine tuning," IEEE transactions on medical imaging, cilt 37, no. 7, ss. 1562-1573, 2018.
  5. [5] Z. Gu, J. Cheng, H. Fu, K. Zhou, H. Hao, Y. Zhao et al., "Ce-net: Context encoder network for 2d medical image segmentation," IEEE transactions on medical imaging, cilt 38, no. 10, ss. 2281-2292, 2019.
  6. [6] B. Kayalibay, G. Jensen and P. van der Smagt, "CNN-based segmentation of medical imaging data," arXiv preprint arXiv:1701.03056, 2017.
  7. [7] Y. Xue, T. Xu, H. Zhang, L. R. Long, X. Huang, "SegAN: Adversarial Network with Multi-scale L1 Loss for Medical Image Segmentation," Neuroinformatics, cilt 16, ss. 383–392, 2018.
  8. [8] N. Khosravan, A. Mortazi, M. Wallace, and U. Bagci, "Pan: Projective adversarial network for medical image segmentation," Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, ss. 68–76, 2019.

Details

Primary Language

English

Subjects

Image Processing , Deep Learning , Neural Networks

Journal Section

Research Article

Early Pub Date

March 29, 2024

Publication Date

March 29, 2024

Submission Date

November 5, 2023

Acceptance Date

February 11, 2024

Published in Issue

Year 2024 Volume: 15 Number: 1

IEEE
[1]S. Altun Güven and B. Toptaş, “Using Up-to-Date GAN Methods for Aerial Images”, DUJE, vol. 15, no. 1, pp. 87–97, Mar. 2024, doi: 10.24012/dumf.1386384.

Cited By