Research Article
BibTex RIS Cite

Deep Learning-Based Classification of UAV Orthophotos Using MIDNet Architecture

Year 2026, Volume: 8 Issue: 1 , 15 - 27 , 30.03.2026
https://izlik.org/JA49RX58YD

Abstract

Photogrammetric methods have advanced significantly, enabling progress in cartography, construction, agriculture, and natural disaster monitoring. The integration of Structure from Motion (SfM) and orthophoto mapping has facilitated the generation of high-resolution, error-corrected images for various geospatial analyses. However, traditional deep learning-based Convolutional Neural Networks (CNNs) for orthophoto classification face challenges such as high computational costs, limited multiscale feature extraction, and suboptimal accuracy in complex landscapes. To address these limitations, this study introduces Multiscale Inception Depthwise Network (MIDNet), a novel CNN-based architecture designed for efficient and precise classification of UAV-derived high-resolution orthophotos. MIDNet leverages inception modules for multiscale feature extraction and depthwise separable convolutions to enhance computational efficiency without sacrificing performance. Experimental validation conducted on the generated reference dataset demonstrates that MIDNet outperforms the compared deep learning models, achieving an overall accuracy of 96.97%, an average accuracy of 95.96% and a kappa coefficient of 96.29%, surpassing DenseNet121 (OA: 96.32%, AA: 95.47%, Kappa: 95.50%) and InceptionV3 (OA: 96.60%, AA: 94.05%, Kappa: 95.85%), while maintaining the smallest model size (4.05 MB) and fastest testing time (8 seconds). These results underscore MIDNet’s superior classification accuracy, lightweight design, and suitability for resource-constrained environments, making it a compelling advancement in orthophoto classification techniques.

Ethical Statement

This article does not contain any studies with human participants or animals performed by any of the authors

References

  • Agarap, A. F. (2018). Deep Learning using Rectified Linear Units (ReLU). 1, 2–8. http://arxiv.org/abs/1803.08375
  • Aslan, İ., & Polat, N. (2024). Deep learning-based classification of mature and immature lavender plants using UAV orthophotos and a hybrid CNN approach. Earth Science Informatics, 17(2), 1713–1727. https://doi.org/10.1007/s12145-023-01200-7
  • Buyukdemircioglu, M., Can, R., Kocaman, S., & Kada, M. (2022). Deep Learning Based Building Footprint Extraction From Very High Resolution True Orthophotos and Ndsm. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 5(2), 211–218. https://doi.org/10.5194/isprs-annals-V-2-2022-211-2022
  • Carrio, A., Sampedro, C., Rodriguez-Ramos, A., & Campoy, P. (2017). A review of deep learning methods and applications for unmanned aerial vehicles. Journal of Sensors, 2017. https://doi.org/10.1155/2017/3296874
  • Chen, H., Engkvist, O., Wang, Y., Olivecrona, M., & Blaschke, T. (2018). The rise of deep learning in drug discovery. Drug Discovery Today, 23(6), 1241–1250. https://doi.org/10.1016/j.drudis.2018.01.039
  • Ekmekji, A. (Stanford U. (2016). Convolutional Neural Networks for Age and Gender Classificatio. Research Paper.
  • Fırat, H., & Hanbay, D. (2023). Comparison of 3D CNN based deep learning architectures using hyperspectral images. Journal of the Faculty of Engineering and Architecture of Gazi University, 38(1), 521–534. https://doi.org/10.17341/gazimmfd.977688
  • Firat, H., Çiğ, H., Güllüoğlu, M. T., Asker, M. E., & Hanbay, D. (2023). Multiscale Feature Fusion for Hyperspectral Image Classification Using Hybrid 3D-2D Depthwise Separable Convolution Networks. Traitement Du Signal, 40(5), 1921–1939. https://doi.org/10.18280/ts.400512
  • He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2016-Decem, 770–778. https://doi.org/10.1109/CVPR.2016.90
  • Howard, A. G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., & Adam, H. (2017). MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. http://arxiv.org/abs/1704.04861
  • Huang, G., Liu, Z., Van Der Maaten, L., & Weinberger, K. Q. (2017). Densely connected convolutional networks. Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, 2017-Janua, 2261–2269. https://doi.org/10.1109/CVPR.2017.243
  • Işik, G., & Artuner, H. (2016). Radyo Sinyallerinin Derin Öǧrenme Sinir Aǧlari ile Taninmasi. 2016 24th Signal Processing and Communication Application Conference, SIU 2016 - Proceedings, 837–840. https://doi.org/10.1109/SIU.2016.7495870
  • Jaud, M., Grasso, F., Le Dantec, N., Verney, R., Delacourt, C., Ammann, J., Deloffre, J., & Grandjean, P. (2016). Potential of UAVs for monitoring mudflat morphodynamics (Application to the Sein e Estuary, France). ISPRS International Journal of Geo-Information, 5(4). https://doi.org/10.3390/ijgi5040050
  • Krichen, M. (2023). Convolutional Neural Networks: A Survey. Computers, 12(8), 1–41. https://doi.org/10.3390/computers12080151
  • Krizhevsky, A. (2014). One weird trick for parallelizing convolutional neural networks. http://arxiv.org/abs/1404.5997
  • Manakitsa, N., Maraslidis, G. S., Moysis, L., & Fragulis, G. F. (2024). A Review of Machine Learning and Deep Learning for Object Detection, Semantic Segmentation, and Human Action Recognition in Machine and Robotic Vision. Technologies, 12(2). https://doi.org/10.3390/technologies12020015
  • Mittal, P., Singh, R., & Sharma, A. (2020). Deep learning-based object detection in low-altitude UAV datasets: A survey. Image and Vision Computing, 104, 104046. https://doi.org/10.1016/j.imavis.2020.104046
  • Osco, L. P., Marcato Junior, J., Marques Ramos, A. P., de Castro Jorge, L. A., Fatholahi, S. N., de Andrade Silva, J., Matsubara, E. T., Pistori, H., Gonçalves, W. N., & Li, J. (2021). A review on deep learning in UAV remote sensing. International Journal of Applied Earth Observation and Geoinformation, 102. https://doi.org/10.1016/j.jag.2021.102456
  • Park, J., Cho, Y. K., & Kim, S. (2022). Deep learning-based UAV image segmentation and inpainting for generating vehicle-free orthomosaic. International Journal of Applied Earth Observation and Geoinformation, 115(November), 103111. https://doi.org/10.1016/j.jag.2022.103111
  • Qiu, Z., Bai, H., & Chen, T. (2023). Special Vehicle Detection from UAV Perspective via YOLO-GNS Based Deep Learning Network. Drones, 7(2). https://doi.org/10.3390/drones7020117
  • Radovic, M., Adarkwa, O., & Wang, Q. (2017). Object recognition in aerial images using convolutional neural networks. Journal of Imaging, 3(2). https://doi.org/10.3390/jimaging3020021
  • Rakshit, H., & Bagheri Zadeh, P. (2024). A New Approach to Classify Drones Using a Deep Convolutional Neural Network. Drones, 8(7). https://doi.org/10.3390/drones8070319
  • Safonova, A., Tabik, S., Alcaraz-Segura, D., Rubtsov, A., Maglinets, Y., & Herrera, F. (2019). Detection of fir trees (Abies sibirica) damaged by the bark beetle in unmanned aerial vehicle images with deep learning. Remote Sensing, 11(6). https://doi.org/10.3390/rs11060643
  • Saxena, A. (2022). An Introduction to Convolutional Neural Networks. International Journal for Research in Applied Science and Engineering Technology, 10(12), 943–947. https://doi.org/10.22214/ijraset.2022.47789
  • Simonyan, K., & Zisserman, A. (2015). Very deep convolutional networks for large-scale image recognition. 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, 1–14.
  • Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., & Wojna, Z. (2016). Rethinking the Inception Architecture for Computer Vision. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2016-Decem, 2818–2826. https://doi.org/10.1109/CVPR.2016.308
  • Taye, M. M. (2023). Theoretical Understanding of Convolutional Neural Network: Concepts, Architectures, Applications, Future Directions. Computation, 11(3). https://doi.org/10.3390/computation11030052
  • Vargas, Rocio, Mosavi, Amir, & Ruiz, R. (2017). Deep Learning : a Review Deep Learning : a Review. Advances in Intelligent Systems and Computing, July.
  • Yılmaz, H. M., Mutluoğlu, Ö., Ulvi, A., Yaman, A., & Bilgilioğlu, S. S. (2018). İnsansız Hava Aracı ile Ortofoto Üretimi ve Aksaray Üniversitesi Kampüsü Örneği Created Tree Dimensional Model of Aksaray University Campus With Unmanned Aerial Vehicle. 3(2), 129–136.
  • Zhao, X., Wang, L., Zhang, Y., Han, X., Deveci, M., & Parmar, M. (2024). A review of convolutional neural networks in computer vision. In Artificial Intelligence Review (Vol. 57, Issue 4). Springer Netherlands. https://doi.org/10.1007/s10462-024-10721-6
There are 30 citations in total.

Details

Primary Language English
Subjects Photogrammetry and Remote Sensing
Journal Section Research Article
Authors

İlyas Aslan 0000-0003-4388-6633

Nizar Polat 0000-0002-6061-7796

Submission Date February 8, 2026
Acceptance Date March 17, 2026
Publication Date March 30, 2026
IZ https://izlik.org/JA49RX58YD
Published in Issue Year 2026 Volume: 8 Issue: 1

Cite

APA Aslan, İ., & Polat, N. (2026). Deep Learning-Based Classification of UAV Orthophotos Using MIDNet Architecture. Turkish Journal of Applied Geoinformation Sciences, 8(1), 15-27. https://izlik.org/JA49RX58YD