EN
A Survey on Text-Line Segmentation in Arab Historical Manuscripts
Abstract
The segmentation process entails dividing or decomposing the entire document image into segments. This operation serves as a fundamental step in developing any writing or optical character recognition system. However, numerous existing segmentation schemes encounter challenges when dealing with specific script styles, like ancient or historical Arabic writing found in ancient manuscripts, which possesses unique characteristics. These characteristics include inclined text lines, overlapping letters, diacritic marks, decorative elements, variable letter forms, and ligatures (combinations of two or more letters merged to form a single connected shape). Thus, in this paper, we present a thorough survey of the field. The survey is composed of two parts. The first section provides a concise overview of historical Arabic documents. The second, which serves as the primary section, focuses on the crucial step of handwritten document recognition, specifically segmentation. A detailed and systematic overview of various segmentation approaches at different levels for extracting handwritten Arabic text-lines is outlined, followed by a literature study analyzing proposed works in this area.
Keywords
References
- Paola Orsatti. Le manuscrit islamique: caract´eristiques mat´erielles et typologie. In Ancient and Medieval Book Materials and Techniques, volume 2, pages 269–331. Biblioteca Apostolica Vaticana, 1993.
- Ayman Al-Dmour and Fares Fraij. Segmenting arabic handwritten documents into text lines and words. International journal of Advancements in Computing technology, 6(3):109, 2014.
- Islamic medical manuscripts at the national library of medicine. https://www.nlm.nih.gov/hmd/arabic/arabichome.html. Accessed: 2023-03-10.
- Bibliothèque nationale de tunisie. http://www.bibliotheque.nat.tn. Accessed: 2023-03-10.
- Thibault Lebore. Segmentation d’image application aux documents anciens. Mémoire de Master de recherche, Université de Nante, France, 2007.
- Takwa Ben Aïcha Gader and Afef Kacem Echi. Unconstrained handwritten arabic text-lines segmentation based on ar2u-net. In 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR), pages 349–354. IEEE, 2020.
- A Bennasri, A Zahour, and B Taconet. Extraction des lignes d’un texte manuscrit arabe. In Vision interface, volume 99, pages 42–48, 1999.
- Alamri Huda, J Sadri, CY Suen, and Nicola Nobile. A novel comprehensive database for arabic off-line handwriting recognition. In Proceedings of 11th International Conference on Frontiers in Handwriting Recognition, ICFHR, volume 8, pages 664–669, 2008.
Details
Primary Language
English
Subjects
Artificial Intelligence (Other)
Journal Section
Review Article
Early Pub Date
May 28, 2024
Publication Date
June 13, 2024
Submission Date
December 20, 2023
Acceptance Date
April 3, 2024
Published in Issue
Year 2024 Volume: 7 Number: 1
APA
Djaghbellou, S., Attıa, A., & Bouzıane, A. (2024). A Survey on Text-Line Segmentation in Arab Historical Manuscripts. International Journal of Informatics and Applied Mathematics, 7(1), 14-32. https://doi.org/10.53508/ijiam.1407236
AMA
1.Djaghbellou S, Attıa A, Bouzıane A. A Survey on Text-Line Segmentation in Arab Historical Manuscripts. IJIAM. 2024;7(1):14-32. doi:10.53508/ijiam.1407236
Chicago
Djaghbellou, Soumia, Abdelouahab Attıa, and Abderraouf Bouzıane. 2024. “A Survey on Text-Line Segmentation in Arab Historical Manuscripts”. International Journal of Informatics and Applied Mathematics 7 (1): 14-32. https://doi.org/10.53508/ijiam.1407236.
EndNote
Djaghbellou S, Attıa A, Bouzıane A (June 1, 2024) A Survey on Text-Line Segmentation in Arab Historical Manuscripts. International Journal of Informatics and Applied Mathematics 7 1 14–32.
IEEE
[1]S. Djaghbellou, A. Attıa, and A. Bouzıane, “A Survey on Text-Line Segmentation in Arab Historical Manuscripts”, IJIAM, vol. 7, no. 1, pp. 14–32, June 2024, doi: 10.53508/ijiam.1407236.
ISNAD
Djaghbellou, Soumia - Attıa, Abdelouahab - Bouzıane, Abderraouf. “A Survey on Text-Line Segmentation in Arab Historical Manuscripts”. International Journal of Informatics and Applied Mathematics 7/1 (June 1, 2024): 14-32. https://doi.org/10.53508/ijiam.1407236.
JAMA
1.Djaghbellou S, Attıa A, Bouzıane A. A Survey on Text-Line Segmentation in Arab Historical Manuscripts. IJIAM. 2024;7:14–32.
MLA
Djaghbellou, Soumia, et al. “A Survey on Text-Line Segmentation in Arab Historical Manuscripts”. International Journal of Informatics and Applied Mathematics, vol. 7, no. 1, June 2024, pp. 14-32, doi:10.53508/ijiam.1407236.
Vancouver
1.Soumia Djaghbellou, Abdelouahab Attıa, Abderraouf Bouzıane. A Survey on Text-Line Segmentation in Arab Historical Manuscripts. IJIAM. 2024 Jun. 1;7(1):14-32. doi:10.53508/ijiam.1407236
Cited By
Recent advances in text line segmentation and baseline detection in historical document images: a systematic review
International Journal on Document Analysis and Recognition (IJDAR)
https://doi.org/10.1007/s10032-025-00526-wA Systematic Literature Review of Deep Learning Methods for Handwritten Text Recognition in Historical Arabic Manuscripts
Engineering, Technology & Applied Science Research
https://doi.org/10.48084/etasr.12123