In the realm of contemporary document processing, the challenge of extracting crucial information from diverse invoices necessitates innovative solutions. This article presents a comprehensive three-step methodology to address the complexity of date extraction from invoices. Leveraging LabelStudio, Python, and OpenCV, we constitute a dataset and train a custom object detection model using Ultralytics YOLOv8. Optical Character Recognition (OCR) provides us to convert the image data to string data that is enable to be processed. Regular expressions refine the extracted text, achieving precise date formats. The developed system significantly enhance the time efficiency, marking a noteworthy advancement in date extraction from invoices.
| Birincil Dil | İngilizce |
|---|---|
| Konular | Görüntü İşleme, Modelleme ve Simülasyon |
| Bölüm | Araştırma Makalesi |
| Yazarlar | |
| Gönderilme Tarihi | 6 Aralık 2023 |
| Kabul Tarihi | 30 Ağustos 2024 |
| Yayımlanma Tarihi | 30 Ağustos 2024 |
| DOI | https://doi.org/10.54569/aair.1401234 |
| IZ | https://izlik.org/JA34SN77ZA |
| Yayımlandığı Sayı | Yıl 2024 Cilt: 4 Sayı: 1 |
Advances in Artificial Intelligence Research is an open access journal which means that the content is freely available without charge to the user or his/her institution. All papers are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, which allows users to distribute, remix, adapt, and build upon the material in any medium or format for non-commercial purposes only, and only so long as attribution is given to the creator.
Graphic design @ Özden Işıktaş