A Detailed Analysis of Optical Character Recognition Technology
Abstract
In many different fields, there is a high demand for storing information to a computer storage disk from the data available in printed or handwritten documents or images to later re-utilize this information by means of computers. One simple way to store information to a computer system from these printed documents could be first to scan the documents and then store them as image files. But to re-utilize this information, it would very difficult to read or query text or other information from these image files. Therefore a technique to automatically retrieve and store information, in particular text, from image files is needed. Optical character recognition is an active research area that attempts to develop a computer system with the ability to extract and process text from images automatically. The objective of OCR is to achieve modification or conversion of any form of text or text-containing documents such as handwritten text, printed or scanned text images, into an editable digital format for deeper and further processing. Therefore, OCR enables a machine to automatically recognize text in such documents. Some major challenges need to be recognized and handled in order to achieve a successful automation. The font characteristics of the characters in paper documents and quality of images are only some of the recent challenges. Due to these challenges, characters sometimes may not be recognized correctly by computer system. In this paper we investigate OCR in four different ways. First we give a detailed overview of the challenges that might emerge in OCR stages. Second, we review the general phases of an OCR system such as pre-processing, segmentation, normalization, feature extraction, classification and post-processing. Then, we highlight developments and main applications and uses of OCR and finally, a brief OCR history are discussed. Therefore, this discussion provides a very comprehensive review of the state-of-the-art of the field.
Keywords
References
- Optical character recognition by open source OCR tool tesseract: A case study
Details
Primary Language
English
Subjects
Engineering
Journal Section
Conference Paper
Publication Date
December 1, 2016
Submission Date
November 29, 2016
Acceptance Date
December 1, 2016
Published in Issue
Year 2016 Number: Special Issue-1
Cited By
Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network
Computational Intelligence and Neuroscience
https://doi.org/10.1155/2021/2676780Dijital Kütüphanelerde Dokümanlardan Bilgi Geri Kazanımı için Kullanılan Güncel Teknolojiler: Derleme Çalışması
Düzce Üniversitesi Bilim ve Teknoloji Dergisi
https://doi.org/10.29130/dubited.796964Deep learning-based NLP data pipeline for EHR-scanned document information extraction
JAMIA Open
https://doi.org/10.1093/jamiaopen/ooac045Evrişimli tekrarlayan sinir ağı ile metin görüntüleri üzerinde karakter tanıma uygulaması gerçekleştirilmesi
Gazi Üniversitesi Mühendislik Mimarlık Fakültesi Dergisi
https://doi.org/10.17341/gazimmfd.866552A comparative evaluation of the wearable augmented reality-based data presentation interface and traditional methods for data entry tasks
International Journal of Industrial Ergonomics
https://doi.org/10.1016/j.ergon.2021.103190Learning-free, divide and conquer text-line extraction algorithm for printed Arabic text with diacritics
Journal of King Saud University - Computer and Information Sciences
https://doi.org/10.1016/j.jksuci.2022.04.021RECOGNITION OF IRREGULARLY SHAPED WORDS BY USING FRACTAL DIMENSION
European Journal of Technic
https://doi.org/10.36222/ejt.459953Digital Image Processing for Character Detection of Captcha Login Internet Banking Image using Matching Template
Journal of Physics: Conference Series
https://doi.org/10.1088/1742-6596/1179/1/012115A Survey of OCR in Arabic Language: Applications, Techniques, and Challenges
Applied Sciences
https://doi.org/10.3390/app13074584A Multimedia Learning for Chinese Character Image Recognition via Human-Computer Interaction Network
Advances in Multimedia
https://doi.org/10.1155/2022/4427091Classifying Promotion Images Using Optical Character Recognition and Naïve Bayes Classifier
Procedia Computer Science
https://doi.org/10.1016/j.procs.2021.01.033Hybrid Optimization And Effectual Classification For High Recognitions In OCR Systems
Journal of The Institution of Engineers (India): Series B
https://doi.org/10.1007/s40031-021-00604-7Machine Translation Quality in Mobile Apps for Text-based Image Translation
Vertimo studijos
https://doi.org/10.15388/VertStud.2021.4Classification of Letter Images from Scanned Invoices Using Cnn
SSRN Electronic Journal
https://doi.org/10.2139/ssrn.4135060Industrial Optical Character Recognition System in Printing Quality Control of Hot-Rolled Coils Identification
Journal of Control, Automation and Electrical Systems
https://doi.org/10.1007/s40313-019-00551-1A Machine Learning and NLP Approach for Analyzing Eligibility Based on Resume and CV
International Journal of Advanced Research in Science, Communication and Technology
https://doi.org/10.48175/IJARSCT-3905Machine-based identification system via optical character recognition
Flexible Services and Manufacturing Journal
https://doi.org/10.1007/s10696-023-09497-8Segmentation-based, omnifont printed Arabic character recognition without font identification
Journal of King Saud University - Computer and Information Sciences
https://doi.org/10.1016/j.jksuci.2020.10.001An Effectual Optical Character Recognition Using Efficient Learning System
SSRN Electronic Journal
https://doi.org/10.2139/ssrn.3358254PACMAN: A Framework for Pulse Oximeter Digit Detection and Reading in a Low-Resource Setting
IEEE Internet of Things Journal
https://doi.org/10.1109/JIOT.2023.3262205Gpu-based and streaming-enabled implementation of pre-processing flow towards enhancing optical character recognition accuracy and efficiency
Cluster Computing
https://doi.org/10.1007/s10586-023-04137-0Handwritten Odia Digit Recognition using Learning Systems: A Comparison of Neural Networks and Support Vector Machine Models
ACM Transactions on Asian and Low-Resource Language Information Processing
https://doi.org/10.1145/3626524OCR-MRD: performance analysis of different optical character recognition engines for medical report digitization
International Journal of Information Technology
https://doi.org/10.1007/s41870-023-01610-2Image Pre-Processing Effect on OCR's Performance for Image Conversion to Braille Unicode
Procedia Computer Science
https://doi.org/10.1016/j.procs.2023.10.599AFA-Net: Adaptive Feature Attention Network in image deblurring and super-resolution for improving license plate recognition
Computer Vision and Image Understanding
https://doi.org/10.1016/j.cviu.2023.103879Applicability of OCR Engines for Text Recognition in Vehicle Number Plates, Receipts and Handwriting
Journal of Circuits, Systems and Computers
https://doi.org/10.1142/S0218126623503218An algorithm of line segmentation and reading order sorting based on adjacent character detection: A post-processing of OCR for digitization of Chinese historical texts
Journal of Cultural Heritage
https://doi.org/10.1016/j.culher.2024.02.001A hybrid convolutional neural network and support vector machine classifier for Amharic character recognition
Neural Computing and Applications
https://doi.org/10.1007/s00521-024-09657-3Positive Online Customer Reviews Significantly Boost Sales for Micro-Businesses
Integrated Journal for Research in Arts and Humanities
https://doi.org/10.55544/ijrah.4.4.14Optical Recognition of the English Alphabet Using Proteinoids
ACS Omega
https://doi.org/10.1021/acsomega.4c06401Exploring sentiment analysis in handwritten and E-text documents using advanced machine learning techniques: a novel approach
Journal of Big Data
https://doi.org/10.1186/s40537-025-01064-2A systematic review of multilingual numeral recognition systems
Artificial Intelligence Review
https://doi.org/10.1007/s10462-025-11105-0Generative artificial intelligence and machine learning methods to screen social media content
PeerJ Computer Science
https://doi.org/10.7717/peerj-cs.2710Dangerous Goods Detection and Warning Approach Based on Image Processing Techniques
Turkish Journal of Science and Technology
https://doi.org/10.55525/tjst.1563258Generative AI for Intelligent Manufacturing Virtual Assistants in the Semiconductor Industry
IEEE Robotics and Automation Letters
https://doi.org/10.1109/LRA.2025.3544506Large Language Models (LLMs) as Graphing Tools for Advanced Chemistry Education and Research
Journal of Chemical Education
https://doi.org/10.1021/acs.jchemed.4c01498Large Language Model Based Intelligent Fault Information Retrieval System for New Energy Vehicles
Applied Sciences
https://doi.org/10.3390/app15074034Community science helps digitize 78 years of fish and habitat data for thousands of lakes in Michigan, USA
Scientific Data
https://doi.org/10.1038/s41597-025-05241-zTowards the digitization of Kurdish handwritten script using deep learning: a comprehensive analysis
International Journal on Document Analysis and Recognition (IJDAR)
https://doi.org/10.1007/s10032-025-00545-7SmartARW: A text-aware smart mobile industrial augmented reality (AR) wiring assembly system
Computers in Industry
https://doi.org/10.1016/j.compind.2025.104379Turkish Optical Character Recognition Under the Lens: A Systematic Review of Language-Specific Challenges, Dataset Scarcity, and Open-Source Limitations
IEEE Access
https://doi.org/10.1109/ACCESS.2025.3614147A Comprehensive Survey of Transformers in Text Recognition: Techniques, Challenges, and Future Directions
ACM Computing Surveys
https://doi.org/10.1145/3771273Advancements and Challenges in Arabic Optical Character Recognition: A Comprehensive Survey
ACM Computing Surveys
https://doi.org/10.1145/3768150Image Segmentation and Text Extraction Techniques for Efficient Information Retrieval
INTI Journal
https://doi.org/10.61453/jods.v2025no15Automating the Extraction of Structured Data from Large Newspaper Corpora using Layout Analysis, OCR and Generative AI
Journal on Computing and Cultural Heritage
https://doi.org/10.1145/3800678