Research Article
BibTex RIS Cite

Crowd Detection: Leveraging YOLO for Human Recognition

Year 2025, Volume: 9 Issue: 3, 571 - 577, 01.07.2025
https://doi.org/10.31127/tuje.1627839

Abstract

Human detection in crowded environments is essential for applications such as surveillance, autonomous navigation, and crowd management. This study examines the performance of various YOLO (You Only Look Once) models in detecting humans. We combined four public human detection datasets to create a comprehensive dataset for crowd detection. Experiments were conducted on YOLOv5, YOLOv8, and YOLOv11 models, employing different architectures and model sizes. Performance was evaluated using mean Average Precision (mAP) at Intersection over Union (IoU) thresholds of 50% (mAP@50) and across 50-95% (mAP@50-95). The results indicate that the YOLOv8m model achieved the highest mAP@50 of 0.944 and mAP@50-95 of 0.697, surpassing larger models such as YOLOv11x, which attained 0.90 and 0.632 respectively. Additionally, other YOLOv8 variants demonstrated superior or comparable performance to their YOLOv5 and YOLOv11 counterparts. These findings highlight the effectiveness of YOLOv8’s optimized structures in delivering accurate and efficient human detection in high-density settings.

References

  • Li, T., Chang, H., Wang, M., Ni, B., Hong, R., & Yan, S. (2014). Crowded scene analysis: A survey. IEEE Transactions on Circuits and Systems for Video Technology, 25(3), 367–386. https://doi.org/10.1109/TCSVT.2014.2358029.
  • Redmon, J. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. https://doi.org/10.1109/ISCAS.2008.4542023.
  • Girshick, R., Donahue, J., Darrell, T., Berkeley, U., & Malik, J. (2014). R-CNN: Region-based convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 2–9).
  • Zhao, L., & Li, S. (2020). Object detection algorithm based on improved YOLOv3. Electronics, 9(3), 537. https://doi.org/10.3390/electronics9030537.
  • Bochkovskiy, A., Wang, C.-Y., & Liao, H.-Y. M. (2020). YOLOv4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934. https://doi.org/10.48550/arXiv.2004.10934
  • Zhan, W., Sun, C., Wang, M., She, J., Zhang, Y., Zhang, Z., & Sun, Y. (2022). An improved YOLOv5 real-time detection method for small objects captured by UAV. Soft Computing, 26, 361–373. https://doi.org/10.1007/s00500-021-06407-8
  • Li, C., Li, L., Jiang, H., Weng, K., Geng, Y., Li, L., Ke, Z., Li, Q., Cheng, M., Nie, W., et al. (2022). YOLOv6: A single-stage object detection framework for industrial applications. arXiv preprint arXiv:2209.02976.https://doi.org/10.48550/arXiv.2209.02976
  • Wang, C.-Y., Bochkovskiy, A., & Liao, H.-Y. M. (2023). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 7464–7475). https://doi.org/10.1109/CVPR52729.2023.00721.
  • Wang, A., Chen, H., Liu, L., Chen, K., Lin, Z., Han, J., & Ding, G. (2024). YOLOv10: Real-time end-to-end object detection. arXiv preprint arXiv:2405.14458. https://doi.org/10.48550/arXiv.2405.14458.
  • Gülgün, O. D., & Erol, H. (2020). Classification performance comparisons of deep learning models in pneumonia diagnosis using chest X-ray images. Turkish Journal of Engineering, 4(3), 129–141. https://doi.org/10.31127/tuje.652358.
  • Aydın, V. A. (2024). Comparison of CNN-based methods for yoga pose classification. Turkish Journal of Engineering, 8(1), 65–75. https://doi.org/10.31127/tuje.1275826
  • Polater, S. N., & Sevli, O. (2024). Deep learning based classification for Alzheimer’s disease detection using MRI images. Turkish Journal of Engineering, 8(4), 729–740. https://doi.org/10.31127/tuje.1434866.
  • Fu, M., Xu, P., Li, X., Liu, Q., Ye, M., & Zhu, C. (2015). Fast crowd density estimation with convolutional neural networks. Engineering Applications of Artificial Intelligence, 43, 81–88. https://doi.org/10.1016/j.engappai.2015.04.006.
  • Oghaz, M. M., Khadka, A. R., Argyriou, V., & Remagnino, P. (2019). Content-aware density map for crowd counting and density estimation. arXiv preprint arXiv:1906.07258. https://doi.org/10.48550/arXiv.1906.07258.
  • Wang, C., Zhang, H., Yang, L., Liu, S., & Cao, X. (2015). Deep people counting in extremely dense crowds. In Proceedings of the 23rd ACM International Conference on Multimedia (pp. 1299–1302). https://doi.org/10.1145/2733373.280633.
  • Alhawsawi, A. N., Khan, S. D., & Rehman, F. U. (2024). Crowd counting in diverse environments using a deep routing mechanism informed by crowd density levels. Information, 15(5), 275. https://doi.org/10.3390/info15050275.
  • Yang, G., & Zhu, D. (2023). Survey on algorithms of people counting in dense crowd and crowd density estimation. Multimedia Tools and Applications, 82(9), 13637–13648. https://doi.org/10.1007/s11042-022-13957-y.
  • Yao, H., Han, K., Wan, W., & Hou, L. (2017). Deep spatial regression model for image crowd counting. arXiv preprint arXiv:1710.09757. https://doi.org/10.48550/arXiv.1710.09757. Hao, Y., Du, H., Mao, M., Liu, Y., & Fan, J. (2023). A survey on regression-based crowd counting techniques. Information Technology and Control, 52(3), 693–712. https://doi.org/10.5755/j01.itc.52.3.33701.
  • Lee, H., Lee, K., Kang, J., & Sohn, K. (2024). Training a regression-based model for crowd counting in transit cars using ranked image pairs and triplets. IEEE Access. https://doi.org/10.1109/ACCESS.2024.3355442.
  • Zhang, L., Lin, L., Liang, X., & He, K. (2016). Is Faster R-CNN doing well for pedestrian detection? In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II 14 (pp. 443–457). Springer. https://doi.org/10.1007/978-3-319-46475-6_28.
  • Sindagi, V. A., & Patel, V. M. (2018). A survey of recent advances in CNN-based single image crowd counting and density estimation. Pattern Recognition Letters, 107, 3–16. https://doi.org/10.1016/j.patrec.2017.07.007.
  • Ali, M. A., Hussain, A. J., & Sadiq, A. T. (2022). Detection and count of human bodies in a crowd scene based on enhancement features by using the YOLOv5 algorithm. Iraqi Journal of Computers, Communications, Control and Systems Engineering, 22(2), 125–134. https://doi.org/10.33103/uot.ijccce.22.2.11.
  • Elshahawy, M., Aseeri, A. O., El-Sappagh, S., Soliman, H., Elmogy, M., & Abu-Elkheir, M. (2022). Identification and classification of crowd activities. CMC-Computers, Materials & Continua, 72(1), 815–832. https://doi.org/10.32604/cmc.2022.023852.
  • People detection dataset. Retrieved December 24, 2024, from https://universe.roboflow.com/hcl-ca18b/people-detection-yrmsh
  • People dataset. Retrieved December 24, 2024, from https://universe.roboflow.com/capstone-ssbpj/people-fycjl
  • Solar system dataset. Retrieved December 24, 2024, from https://universe.roboflow.com/human-dataset-v2/solar-system
  • People detection dataset. Retrieved December 24, 2024, from https://universe.roboflow.com/jmedel/people-detection-f0fgt
  • People detection combined dataset. Retrieved December 24, 2024, from https://app.roboflow.com/crowd-counting-bsg6a/people-detection-combined-dataset
There are 28 citations in total.

Details

Primary Language English
Subjects Software Engineering (Other)
Journal Section Articles
Authors

Gülsüm Yiğit 0000-0001-7010-169X

Publication Date July 1, 2025
Submission Date January 31, 2025
Acceptance Date April 7, 2025
Published in Issue Year 2025 Volume: 9 Issue: 3

Cite

APA Yiğit, G. (2025). Crowd Detection: Leveraging YOLO for Human Recognition. Turkish Journal of Engineering, 9(3), 571-577. https://doi.org/10.31127/tuje.1627839
AMA Yiğit G. Crowd Detection: Leveraging YOLO for Human Recognition. TUJE. July 2025;9(3):571-577. doi:10.31127/tuje.1627839
Chicago Yiğit, Gülsüm. “Crowd Detection: Leveraging YOLO for Human Recognition”. Turkish Journal of Engineering 9, no. 3 (July 2025): 571-77. https://doi.org/10.31127/tuje.1627839.
EndNote Yiğit G (July 1, 2025) Crowd Detection: Leveraging YOLO for Human Recognition. Turkish Journal of Engineering 9 3 571–577.
IEEE G. Yiğit, “Crowd Detection: Leveraging YOLO for Human Recognition”, TUJE, vol. 9, no. 3, pp. 571–577, 2025, doi: 10.31127/tuje.1627839.
ISNAD Yiğit, Gülsüm. “Crowd Detection: Leveraging YOLO for Human Recognition”. Turkish Journal of Engineering 9/3 (July2025), 571-577. https://doi.org/10.31127/tuje.1627839.
JAMA Yiğit G. Crowd Detection: Leveraging YOLO for Human Recognition. TUJE. 2025;9:571–577.
MLA Yiğit, Gülsüm. “Crowd Detection: Leveraging YOLO for Human Recognition”. Turkish Journal of Engineering, vol. 9, no. 3, 2025, pp. 571-7, doi:10.31127/tuje.1627839.
Vancouver Yiğit G. Crowd Detection: Leveraging YOLO for Human Recognition. TUJE. 2025;9(3):571-7.
Flag Counter