Bounding box regression plays a pivotal role in the majority of object detection algorithms, significantly
influencing the accuracy of object positioning and the regression speed of Convolutional Neural Networks (CNN). In object detection benchmarks, Intersection over Union (IoU) remains the widely adopted metric for evaluation. Traditional IoU-based loss functions often suffer from poor training outcomes and slow convergence, and they fail to account for situations where the predicted bounding box does not entirely capture the object’s mask. This study introduces the Mask-based Intersection over Union (MbIoU) metric for improving bounding box regression in object detection using medical images. The proposed MbIoU metric incorporates the object mask into the bounding box regression process, offering a more precise evaluation of how well the predicted bounding box encapsulates the object. The developed MbIoU metric was tested on the MNIST: HAM10000 dermoscopic skin images dataset, COVID-19 CT dataset, and Brain Tumor dataset and compared to traditional IoU metrics. The results show that MbIoU enhances the prediction by better capturing the object’s contained mask.
Object detection convolutional neural networks bounding box regression intersection over union.
Primary Language | English |
---|---|
Subjects | Deep Learning, Machine Vision |
Journal Section | Articles |
Authors | |
Publication Date | December 31, 2024 |
Submission Date | January 21, 2024 |
Acceptance Date | October 18, 2024 |
Published in Issue | Year 2024 Volume: 16 Issue: 2 |