Review Article

A Survey on Text-Line Segmentation in Arab Historical Manuscripts

Volume: 7 Number: 1 June 13, 2024
EN

A Survey on Text-Line Segmentation in Arab Historical Manuscripts

Abstract

The segmentation process entails dividing or decomposing the entire document image into segments. This operation serves as a fundamental step in developing any writing or optical character recognition system. However, numerous existing segmentation schemes encounter challenges when dealing with specific script styles, like ancient or historical Arabic writing found in ancient manuscripts, which possesses unique characteristics. These characteristics include inclined text lines, overlapping letters, diacritic marks, decorative elements, variable letter forms, and ligatures (combinations of two or more letters merged to form a single connected shape). Thus, in this paper, we present a thorough survey of the field. The survey is composed of two parts. The first section provides a concise overview of historical Arabic documents. The second, which serves as the primary section, focuses on the crucial step of handwritten document recognition, specifically segmentation. A detailed and systematic overview of various segmentation approaches at different levels for extracting handwritten Arabic text-lines is outlined, followed by a literature study analyzing proposed works in this area.

Keywords

References

  1. Paola Orsatti. Le manuscrit islamique: caract´eristiques mat´erielles et typologie. In Ancient and Medieval Book Materials and Techniques, volume 2, pages 269–331. Biblioteca Apostolica Vaticana, 1993.
  2. Ayman Al-Dmour and Fares Fraij. Segmenting arabic handwritten documents into text lines and words. International journal of Advancements in Computing technology, 6(3):109, 2014.
  3. Islamic medical manuscripts at the national library of medicine. https://www.nlm.nih.gov/hmd/arabic/arabichome.html. Accessed: 2023-03-10.
  4. Bibliothèque nationale de tunisie. http://www.bibliotheque.nat.tn. Accessed: 2023-03-10.
  5. Thibault Lebore. Segmentation d’image application aux documents anciens. Mémoire de Master de recherche, Université de Nante, France, 2007.
  6. Takwa Ben Aïcha Gader and Afef Kacem Echi. Unconstrained handwritten arabic text-lines segmentation based on ar2u-net. In 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR), pages 349–354. IEEE, 2020.
  7. A Bennasri, A Zahour, and B Taconet. Extraction des lignes d’un texte manuscrit arabe. In Vision interface, volume 99, pages 42–48, 1999.
  8. Alamri Huda, J Sadri, CY Suen, and Nicola Nobile. A novel comprehensive database for arabic off-line handwriting recognition. In Proceedings of 11th International Conference on Frontiers in Handwriting Recognition, ICFHR, volume 8, pages 664–669, 2008.

Details

Primary Language

English

Subjects

Artificial Intelligence (Other)

Journal Section

Review Article

Early Pub Date

May 28, 2024

Publication Date

June 13, 2024

Submission Date

December 20, 2023

Acceptance Date

April 3, 2024

Published in Issue

Year 2024 Volume: 7 Number: 1

APA
Djaghbellou, S., Attıa, A., & Bouzıane, A. (2024). A Survey on Text-Line Segmentation in Arab Historical Manuscripts. International Journal of Informatics and Applied Mathematics, 7(1), 14-32. https://doi.org/10.53508/ijiam.1407236
AMA
1.Djaghbellou S, Attıa A, Bouzıane A. A Survey on Text-Line Segmentation in Arab Historical Manuscripts. IJIAM. 2024;7(1):14-32. doi:10.53508/ijiam.1407236
Chicago
Djaghbellou, Soumia, Abdelouahab Attıa, and Abderraouf Bouzıane. 2024. “A Survey on Text-Line Segmentation in Arab Historical Manuscripts”. International Journal of Informatics and Applied Mathematics 7 (1): 14-32. https://doi.org/10.53508/ijiam.1407236.
EndNote
Djaghbellou S, Attıa A, Bouzıane A (June 1, 2024) A Survey on Text-Line Segmentation in Arab Historical Manuscripts. International Journal of Informatics and Applied Mathematics 7 1 14–32.
IEEE
[1]S. Djaghbellou, A. Attıa, and A. Bouzıane, “A Survey on Text-Line Segmentation in Arab Historical Manuscripts”, IJIAM, vol. 7, no. 1, pp. 14–32, June 2024, doi: 10.53508/ijiam.1407236.
ISNAD
Djaghbellou, Soumia - Attıa, Abdelouahab - Bouzıane, Abderraouf. “A Survey on Text-Line Segmentation in Arab Historical Manuscripts”. International Journal of Informatics and Applied Mathematics 7/1 (June 1, 2024): 14-32. https://doi.org/10.53508/ijiam.1407236.
JAMA
1.Djaghbellou S, Attıa A, Bouzıane A. A Survey on Text-Line Segmentation in Arab Historical Manuscripts. IJIAM. 2024;7:14–32.
MLA
Djaghbellou, Soumia, et al. “A Survey on Text-Line Segmentation in Arab Historical Manuscripts”. International Journal of Informatics and Applied Mathematics, vol. 7, no. 1, June 2024, pp. 14-32, doi:10.53508/ijiam.1407236.
Vancouver
1.Soumia Djaghbellou, Abdelouahab Attıa, Abderraouf Bouzıane. A Survey on Text-Line Segmentation in Arab Historical Manuscripts. IJIAM. 2024 Jun. 1;7(1):14-32. doi:10.53508/ijiam.1407236

Cited By

International Journal of Informatics and Applied Mathematics