Araştırma Makalesi

Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets

Cilt: 9 Sayı: 1 24 Aralık 2025
PDF İndir
TR EN

Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets

Öz

This study focuses on the performance evaluation of cutting-edge object detection models, namely, YOLO12X, Mask R-CNN, RT-DETR-X, and RF-DETR-Large on the Open Images (Multi-Object) and LaSOT (Single-Object) datasets. Current cutting-edge trend applications involve CNN-based and Transformer-based object detection models. CNN-based models can use one-pass (YOLO family) or two-pass (R-CNN family) implementations. One-pass object detection models can be faster but suffer from accuracy compared to the two-pass models. Transformer-based models can use Detection Transformers or Vision Transformers. Transformer-based models are gaining popularity, and their performance surpasses CNN-based models. This study evaluates YOLO12X, Mask R-CNN from CNN-based family, and RT-DETR-X, RF-DETR-Large transformer-based architectures in terms of accuracy and time on the Open Images and the LaSOT datasets. All models are the largest available models and pretrained on COCO dataset. Transformer-based models incorporate special types of self-attention and pose significant improvement both on accuracy and speed. The experimental results demonstrate that attention and transformer-based models perform better than the traditional CNN-based object detectors and YOLO12X is the fastest method with a far margin. On the LaSOT dataset, RT-DETR-X posts 0.8804 IoU, 0.7047 F1-score, 0.6597 mAP@0.5, 28.64 fps whereas YOLO12X achieves 0.8572 IoU, 0.6657 F1-score, 0.5357 mAP@0.5, and 49.78 fps.

Anahtar Kelimeler

Etik Beyan

Since no studies were conducted on animals and humans in this study, ethics committee approval was not obtained.

Kaynakça

  1. Atik, M. E., Duran, Z., & Özgünlük, R. (2022). Comparison of YOLO versions for object detection from aerial images. International Journal of Environment and Geoinformatics, 9(2), 87–93. https://doi.org/10.30897/ijegeo.1010741
  2. Bakır, H., & Bakır, R. (2023). Evaluating the robustness of YOLO object detection algorithm in terms of detecting objects in noisy environment. Journal of Scientific Reports-A, (54), 1–25. https://doi.org/10.59313/jsr-a.1257361
  3. Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., & Zagoruyko, S. (2020). End-to-end object detection with transformers. In A. Vedaldi, H. Bischof, T. Brox, & J. M. Frahm (Eds.), Computer Vision – ECCV 2020 (pp. 213–229). Springer. https://doi.org/10.1007/978-3-030-58452-8_13
  4. Chen, W., Luo, J., Zhang, F., & Tian, Z. (2024). A review of object detection: Datasets, performance evaluation, architecture, applications and current trends. Multimedia Tools and Applications, 83(24), 65603–65661. https://doi.org/10.1007/s11042-023-17949-4
  5. Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), (pp. 886–893). IEEE. https://doi.org/10.1109/CVPR.2005.177
  6. Dayıoğlu, M., Eyüboğlu, A. K., & Ünal, R. (2025). Performance analysis of YOLO11 models in PCB defect detection tasks. Kuzey Ege Teknik Bilimler ve Teknoloji Dergisi, 2(1), 33–50.
  7. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., & Houlsby, N. (2021). An image is worth 16x16 words: Transformers for image recognition at scale [Paper presentation]. International Conference on Learning Representations (ICLR), Virtual.
  8. Ereken, Ö. F., & Tarhan, Ç. (2025). Modeling objects with artificial intelligence based image processing techniques: Object detection with Mask R-CNN. Academic Platform Journal of Engineering and Smart Systems, 13(1), 17–21. https://doi.org/10.21541/apjess.1542885

Ayrıntılar

Birincil Dil

İngilizce

Konular

Bilgi Sistemleri (Diğer)

Bölüm

Araştırma Makalesi

Erken Görünüm Tarihi

24 Aralık 2025

Yayımlanma Tarihi

24 Aralık 2025

Gönderilme Tarihi

7 Temmuz 2025

Kabul Tarihi

22 Aralık 2025

Yayımlandığı Sayı

Yıl 2026 Cilt: 9 Sayı: 1

Kaynak Göster

APA
Parlak, C. (2026). Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets. Black Sea Journal of Engineering and Science, 9(1), 287-294. https://doi.org/10.34248/bsengineering.1736319
AMA
1.Parlak C. Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets. BSJ Eng. Sci. 2026;9(1):287-294. doi:10.34248/bsengineering.1736319
Chicago
Parlak, Cevahir. 2026. “Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets”. Black Sea Journal of Engineering and Science 9 (1): 287-94. https://doi.org/10.34248/bsengineering.1736319.
EndNote
Parlak C (01 Ocak 2026) Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets. Black Sea Journal of Engineering and Science 9 1 287–294.
IEEE
[1]C. Parlak, “Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets”, BSJ Eng. Sci., c. 9, sy 1, ss. 287–294, Oca. 2026, doi: 10.34248/bsengineering.1736319.
ISNAD
Parlak, Cevahir. “Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets”. Black Sea Journal of Engineering and Science 9/1 (01 Ocak 2026): 287-294. https://doi.org/10.34248/bsengineering.1736319.
JAMA
1.Parlak C. Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets. BSJ Eng. Sci. 2026;9:287–294.
MLA
Parlak, Cevahir. “Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets”. Black Sea Journal of Engineering and Science, c. 9, sy 1, Ocak 2026, ss. 287-94, doi:10.34248/bsengineering.1736319.
Vancouver
1.Cevahir Parlak. Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets. BSJ Eng. Sci. 01 Ocak 2026;9(1):287-94. doi:10.34248/bsengineering.1736319

                           24890