Research Article

Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets

Volume: 9 Number: 1 December 24, 2025
TR EN

Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets

Abstract

This study focuses on the performance evaluation of cutting-edge object detection models, namely, YOLO12X, Mask R-CNN, RT-DETR-X, and RF-DETR-Large on the Open Images (Multi-Object) and LaSOT (Single-Object) datasets. Current cutting-edge trend applications involve CNN-based and Transformer-based object detection models. CNN-based models can use one-pass (YOLO family) or two-pass (R-CNN family) implementations. One-pass object detection models can be faster but suffer from accuracy compared to the two-pass models. Transformer-based models can use Detection Transformers or Vision Transformers. Transformer-based models are gaining popularity, and their performance surpasses CNN-based models. This study evaluates YOLO12X, Mask R-CNN from CNN-based family, and RT-DETR-X, RF-DETR-Large transformer-based architectures in terms of accuracy and time on the Open Images and the LaSOT datasets. All models are the largest available models and pretrained on COCO dataset. Transformer-based models incorporate special types of self-attention and pose significant improvement both on accuracy and speed. The experimental results demonstrate that attention and transformer-based models perform better than the traditional CNN-based object detectors and YOLO12X is the fastest method with a far margin. On the LaSOT dataset, RT-DETR-X posts 0.8804 IoU, 0.7047 F1-score, 0.6597 mAP@0.5, 28.64 fps whereas YOLO12X achieves 0.8572 IoU, 0.6657 F1-score, 0.5357 mAP@0.5, and 49.78 fps.

Keywords

Ethical Statement

Since no studies were conducted on animals and humans in this study, ethics committee approval was not obtained.

References

  1. Atik, M. E., Duran, Z., & Özgünlük, R. (2022). Comparison of YOLO versions for object detection from aerial images. International Journal of Environment and Geoinformatics, 9(2), 87–93. https://doi.org/10.30897/ijegeo.1010741
  2. Bakır, H., & Bakır, R. (2023). Evaluating the robustness of YOLO object detection algorithm in terms of detecting objects in noisy environment. Journal of Scientific Reports-A, (54), 1–25. https://doi.org/10.59313/jsr-a.1257361
  3. Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., & Zagoruyko, S. (2020). End-to-end object detection with transformers. In A. Vedaldi, H. Bischof, T. Brox, & J. M. Frahm (Eds.), Computer Vision – ECCV 2020 (pp. 213–229). Springer. https://doi.org/10.1007/978-3-030-58452-8_13
  4. Chen, W., Luo, J., Zhang, F., & Tian, Z. (2024). A review of object detection: Datasets, performance evaluation, architecture, applications and current trends. Multimedia Tools and Applications, 83(24), 65603–65661. https://doi.org/10.1007/s11042-023-17949-4
  5. Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), (pp. 886–893). IEEE. https://doi.org/10.1109/CVPR.2005.177
  6. Dayıoğlu, M., Eyüboğlu, A. K., & Ünal, R. (2025). Performance analysis of YOLO11 models in PCB defect detection tasks. Kuzey Ege Teknik Bilimler ve Teknoloji Dergisi, 2(1), 33–50.
  7. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., & Houlsby, N. (2021). An image is worth 16x16 words: Transformers for image recognition at scale [Paper presentation]. International Conference on Learning Representations (ICLR), Virtual.
  8. Ereken, Ö. F., & Tarhan, Ç. (2025). Modeling objects with artificial intelligence based image processing techniques: Object detection with Mask R-CNN. Academic Platform Journal of Engineering and Smart Systems, 13(1), 17–21. https://doi.org/10.21541/apjess.1542885

Details

Primary Language

English

Subjects

Information Systems (Other)

Journal Section

Research Article

Early Pub Date

December 24, 2025

Publication Date

December 24, 2025

Submission Date

July 7, 2025

Acceptance Date

December 22, 2025

Published in Issue

Year 2026 Volume: 9 Number: 1

APA
Parlak, C. (2026). Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets. Black Sea Journal of Engineering and Science, 9(1), 287-294. https://doi.org/10.34248/bsengineering.1736319
AMA
1.Parlak C. Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets. BSJ Eng. Sci. 2026;9(1):287-294. doi:10.34248/bsengineering.1736319
Chicago
Parlak, Cevahir. 2026. “Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets”. Black Sea Journal of Engineering and Science 9 (1): 287-94. https://doi.org/10.34248/bsengineering.1736319.
EndNote
Parlak C (January 1, 2026) Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets. Black Sea Journal of Engineering and Science 9 1 287–294.
IEEE
[1]C. Parlak, “Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets”, BSJ Eng. Sci., vol. 9, no. 1, pp. 287–294, Jan. 2026, doi: 10.34248/bsengineering.1736319.
ISNAD
Parlak, Cevahir. “Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets”. Black Sea Journal of Engineering and Science 9/1 (January 1, 2026): 287-294. https://doi.org/10.34248/bsengineering.1736319.
JAMA
1.Parlak C. Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets. BSJ Eng. Sci. 2026;9:287–294.
MLA
Parlak, Cevahir. “Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets”. Black Sea Journal of Engineering and Science, vol. 9, no. 1, Jan. 2026, pp. 287-94, doi:10.34248/bsengineering.1736319.
Vancouver
1.Cevahir Parlak. Evaluation of Cutting-Edge Object Detection Architectures on Multi-Object and Single-Object Datasets. BSJ Eng. Sci. 2026 Jan. 1;9(1):287-94. doi:10.34248/bsengineering.1736319

                            24890