Research Article

Mask-Based IoU for Bounding Box Regression Using Medical Images

Volume: 16 Number: 2 December 31, 2024
EN

Mask-Based IoU for Bounding Box Regression Using Medical Images

Abstract

Bounding box regression plays a pivotal role in the majority of object detection algorithms, significantly influencing the accuracy of object positioning and the regression speed of Convolutional Neural Networks (CNN). In object detection benchmarks, Intersection over Union (IoU) remains the widely adopted metric for evaluation. Traditional IoU-based loss functions often suffer from poor training outcomes and slow convergence, and they fail to account for situations where the predicted bounding box does not entirely capture the object’s mask. This study introduces the Mask-based Intersection over Union (MbIoU) metric for improving bounding box regression in object detection using medical images. The proposed MbIoU metric incorporates the object mask into the bounding box regression process, offering a more precise evaluation of how well the predicted bounding box encapsulates the object. The developed MbIoU metric was tested on the MNIST: HAM10000 dermoscopic skin images dataset, COVID-19 CT dataset, and Brain Tumor dataset and compared to traditional IoU metrics. The results show that MbIoU enhances the prediction by better capturing the object’s contained mask.

Keywords

References

  1. Brain Tumor Segmentation, Available at: https://github.com/rastislavkopal/brain-tumor-segmentation/tree/main/brain_tumor_data. Accessed on: 2024-09-20.
  2. Cai, Z., Vasconcelos, N., Cascade R-CNN: Delving Into High Quality Object Detection, Proceedings Of The IEEE Conference On Computer Vision And Pattern Recognition (CVPR), 2018.
  3. Cheng, G., Han, J., Zhou, P., Xu, D., Learning rotation-invariant and fisher discriminative convolutional neural networks for object detection, IEEE Transactions On Image Processing, 28(2018), 265–278.
  4. COVID-19 CT Scan Lesion Segmentation Dataset. Available at: https://www.kaggle.com/datasets/maedemaftouni/covid19-ct-scan-lesion-segmentation-dataset. Accessed on: 2024-09-20.
  5. Dai, J., Li, Y., He, K., Sun, J., R-fcn: Object detection via region-based fully convolutional network, Advances In Neural Information Processing Systems, 29(2016).
  6. Dalal, N., Triggs, B., Histograms of oriented gradients for human detection, 2005 IEEE Computer Society Conference On Computer Vision And Pattern Recognition (CVPR’05), 2005.
  7. Girshick, R., Fast r-cnn,Proceedings of the ieee international conference on computer vision, 2(2015), 1440–1448.
  8. Girshick, R., Donahue, J., Darrell, T., Malik, J., Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, Proceedings Of The IEEE Conference On Computer Vision And Pattern Recognition (CVPR), 2014.

Details

Primary Language

English

Subjects

Deep Learning, Machine Vision

Journal Section

Research Article

Publication Date

December 31, 2024

Submission Date

January 21, 2024

Acceptance Date

October 18, 2024

Published in Issue

Year 2024 Volume: 16 Number: 2

APA
Çakar, S., Kotan, M., Öz, C., Sönmez, A. F., Cerezci, F., & Delibaşoğlu, İ. (2024). Mask-Based IoU for Bounding Box Regression Using Medical Images. Turkish Journal of Mathematics and Computer Science, 16(2), 325-332. https://doi.org/10.47000/tjmcs.1423292
AMA
1.Çakar S, Kotan M, Öz C, Sönmez AF, Cerezci F, Delibaşoğlu İ. Mask-Based IoU for Bounding Box Regression Using Medical Images. TJMCS. 2024;16(2):325-332. doi:10.47000/tjmcs.1423292
Chicago
Çakar, Serap, Muhammed Kotan, Cemil Öz, Ahmet Furkan Sönmez, Feyza Cerezci, and İbrahim Delibaşoğlu. 2024. “Mask-Based IoU for Bounding Box Regression Using Medical Images”. Turkish Journal of Mathematics and Computer Science 16 (2): 325-32. https://doi.org/10.47000/tjmcs.1423292.
EndNote
Çakar S, Kotan M, Öz C, Sönmez AF, Cerezci F, Delibaşoğlu İ (December 1, 2024) Mask-Based IoU for Bounding Box Regression Using Medical Images. Turkish Journal of Mathematics and Computer Science 16 2 325–332.
IEEE
[1]S. Çakar, M. Kotan, C. Öz, A. F. Sönmez, F. Cerezci, and İ. Delibaşoğlu, “Mask-Based IoU for Bounding Box Regression Using Medical Images”, TJMCS, vol. 16, no. 2, pp. 325–332, Dec. 2024, doi: 10.47000/tjmcs.1423292.
ISNAD
Çakar, Serap - Kotan, Muhammed - Öz, Cemil - Sönmez, Ahmet Furkan - Cerezci, Feyza - Delibaşoğlu, İbrahim. “Mask-Based IoU for Bounding Box Regression Using Medical Images”. Turkish Journal of Mathematics and Computer Science 16/2 (December 1, 2024): 325-332. https://doi.org/10.47000/tjmcs.1423292.
JAMA
1.Çakar S, Kotan M, Öz C, Sönmez AF, Cerezci F, Delibaşoğlu İ. Mask-Based IoU for Bounding Box Regression Using Medical Images. TJMCS. 2024;16:325–332.
MLA
Çakar, Serap, et al. “Mask-Based IoU for Bounding Box Regression Using Medical Images”. Turkish Journal of Mathematics and Computer Science, vol. 16, no. 2, Dec. 2024, pp. 325-32, doi:10.47000/tjmcs.1423292.
Vancouver
1.Serap Çakar, Muhammed Kotan, Cemil Öz, Ahmet Furkan Sönmez, Feyza Cerezci, İbrahim Delibaşoğlu. Mask-Based IoU for Bounding Box Regression Using Medical Images. TJMCS. 2024 Dec. 1;16(2):325-32. doi:10.47000/tjmcs.1423292