Research Article

Multiple Object Detection and Tracking in Real-Time Aerial Imagery with Deep Learning Architectures

Volume: 9 Number: 1 July 31, 2025
TR EN

Multiple Object Detection and Tracking in Real-Time Aerial Imagery with Deep Learning Architectures

Abstract

Real-time multi-object detection and tracking is one of the main challenges in analysing aerial imagery from platforms such as unmanned aerial vehicles (UAVs) and satellite systems. Traditional tracking algorithms are inadequate especially in complex and dynamic environments, which necessitates the development of more powerful and flexible methods. This study proposes a deep learning based solution in line with the objectives of high extraction rate and sufficient accuracy. Within the scope of the study, the YOLOv11 model, which is a single-stage object detection architecture, is integrated with the ByteTrack algorithm for tracking the detected objects. This approach is evaluated in comparison with the two-stage Faster R-CNN architecture (MobileNetV3-large FPN backbone), which is known for its high accuracy in object detection. The training of the models was performed on a comprehensive dataset created by combining the challenging VisDrone and DOTA datasets containing objects of different sizes. The results show that the YOLOv11 model is more suitable for real-time applications due to its high speed inference capability, while the Faster R-CNN model provides more accurate detections despite its higher computational cost. The ByteTrack algorithm used in the tracking phase has increased the tracking accuracy by successfully identifying the detected objects. In this context, the advantages and limitations of both models are evaluated and it is concluded that the choice of model should be made according to the requirements of the targeted application.

Keywords

Supporting Institution

The Scientific and Technological Research Council of Turkey (TÜBİTAK) – 2209-A Research Projects Support Programme for Undergraduate Students (Project No: 1919B012406548)

Project Number

1919B012406548

Ethical Statement

This study was conducted as part of the undergraduate capstone project in the Department of Computer Engineering at Tokat Gaziosmanpaşa University.

References

  1. [1] Simonyan, Karen & Zisserman, Andrew. ( 2014 ). Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv 1409.1556.].
  2. [2] Inik, O., İnik, Ö., Öztaş, T., Demir, Y., & Yüksel, A. (2023). Prediction of soil organic matter with deep learning. Arabian Journal for Science and Engineering, 48(8), 10227-10247.
  3. [3] Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems, 25.
  4. [4] İnik, Ö., Uyar, K., & Ülker, E. (2018). Gender classification with a novel convolutional neural network (CNN) model and comparison with other machine learning and deep learning CNN models. Journal Of Industrial Engineering Research, 4(4), 57-63.
  5. [5] Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., ... & Rabinovich, A. (2015). Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1-9).
  6. [6] İnik, Ö., & Turan, B. (2018). Classification of animals with different deep learning models. Journal of New Results in Science, 7(1), 9-16.
  7. [7] Liu, Z., Chen, Y., & Gao, Y. (2024). Rotating-YOLO: A novel YOLO model for remote sensing rotating object detection. Image and Vision Computing, 105397.
  8. [8] Wang, T., Ma, Z., Yang, T., & Zou, S. (2023). PETNet: A YOLO-based prior enhanced transformer network for aerial image detection. Neurocomputing, 547, 126384.

Details

Primary Language

English

Subjects

Image Processing, Artificial Intelligence (Other)

Journal Section

Research Article

Early Pub Date

July 27, 2025

Publication Date

July 31, 2025

Submission Date

July 5, 2025

Acceptance Date

July 27, 2025

Published in Issue

Year 2025 Volume: 9 Number: 1

APA
Akyüz, B., Bahadır, M., & İnik, Ö. (2025). Multiple Object Detection and Tracking in Real-Time Aerial Imagery with Deep Learning Architectures. International Journal of Multidisciplinary Studies and Innovative Technologies, 9(1), 162-172. https://izlik.org/JA83DG34BN
AMA
1.Akyüz B, Bahadır M, İnik Ö. Multiple Object Detection and Tracking in Real-Time Aerial Imagery with Deep Learning Architectures. IJMSIT. 2025;9(1):162-172. https://izlik.org/JA83DG34BN
Chicago
Akyüz, Betül, Melih Bahadır, and Özkan İnik. 2025. “Multiple Object Detection and Tracking in Real-Time Aerial Imagery With Deep Learning Architectures”. International Journal of Multidisciplinary Studies and Innovative Technologies 9 (1): 162-72. https://izlik.org/JA83DG34BN.
EndNote
Akyüz B, Bahadır M, İnik Ö (August 1, 2025) Multiple Object Detection and Tracking in Real-Time Aerial Imagery with Deep Learning Architectures. International Journal of Multidisciplinary Studies and Innovative Technologies 9 1 162–172.
IEEE
[1]B. Akyüz, M. Bahadır, and Ö. İnik, “Multiple Object Detection and Tracking in Real-Time Aerial Imagery with Deep Learning Architectures”, IJMSIT, vol. 9, no. 1, pp. 162–172, Aug. 2025, [Online]. Available: https://izlik.org/JA83DG34BN
ISNAD
Akyüz, Betül - Bahadır, Melih - İnik, Özkan. “Multiple Object Detection and Tracking in Real-Time Aerial Imagery With Deep Learning Architectures”. International Journal of Multidisciplinary Studies and Innovative Technologies 9/1 (August 1, 2025): 162-172. https://izlik.org/JA83DG34BN.
JAMA
1.Akyüz B, Bahadır M, İnik Ö. Multiple Object Detection and Tracking in Real-Time Aerial Imagery with Deep Learning Architectures. IJMSIT. 2025;9:162–172.
MLA
Akyüz, Betül, et al. “Multiple Object Detection and Tracking in Real-Time Aerial Imagery With Deep Learning Architectures”. International Journal of Multidisciplinary Studies and Innovative Technologies, vol. 9, no. 1, Aug. 2025, pp. 162-7, https://izlik.org/JA83DG34BN.
Vancouver
1.Betül Akyüz, Melih Bahadır, Özkan İnik. Multiple Object Detection and Tracking in Real-Time Aerial Imagery with Deep Learning Architectures. IJMSIT [Internet]. 2025 Aug. 1;9(1):162-7. Available from: https://izlik.org/JA83DG34BN