Araştırma Makalesi

Augmenting parametric data synthesis with 3D simulation for OCR on Old Turkic runiform inscriptions: A case study of the Kül Tegin inscription

Cilt: 8 Sayı: 2 21 Temmuz 2024
PDF İndir
TR EN

Augmenting parametric data synthesis with 3D simulation for OCR on Old Turkic runiform inscriptions: A case study of the Kül Tegin inscription

Öz

Optical character recognition for historical scripts like Old Turkic runiform script poses significant challenges due to the need for abundant annotated data and varying writing styles, materials, and degradations. The paper proposes a novel data synthesis pipeline that augments parametric generation with 3D rendering to build realistic and diverse training data for Old Turkic runiform script grapheme classification. Our approach synthesizes distance field variations of graphemes, applies parametric randomization, and renders them in simulated 3D scenes with varying textures, lighting, and environments. We train a Vision Transformer model on the synthesized data and evaluate its performance on the Kül Tegin inscription photographs. Experimental results demonstrate the effectiveness of our approach, with the model achieving high accuracy without seeing any real-world data during training. We finally discuss avenues for future research. Our work provides a promising direction to overcome data scarcity in Old Turkic runiform script.

Anahtar Kelimeler

Kaynakça

  1. Akenine-Moller, T. et al. (2019). Real-time rendering. London-New York: AK Peters/CRC Press.
  2. AlKendi, W. et al. (2024). Advancements and Challenges in Handwritten Text Recognition: A Comprehensive Survey. Journal of Imaging, 10(1), 18.
  3. Blender Foundation. (2024). Blender - A 3D modelling and rendering package. (Retrieved from www.blender.org)
  4. Bradski, G. et al. (2000). OpenCV. Dr. Dobb’s Journal of Software Tools, 3(2).
  5. Buslaev, A. et al. (2020). Albumentations: fast and flexible image augmentations. Information, 11(2), 125.
  6. Celso M. de Melo et al. (2022). Next-generation deep learning based on simulators and synthetic data. Trends in Cognitive Sciences, 26(2), 174–187.
  7. Chefer, H. et al. (2021). Transformer interpretability beyond attention visualization. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 782–791.
  8. Choudhary, T. et al. (2020). A comprehensive survey on model compression and acceleration. Artificial Intelligence Review, 53, 5113–5155.

Ayrıntılar

Birincil Dil

Türkçe

Konular

Dilbilim (Diğer)

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

21 Temmuz 2024

Gönderilme Tarihi

15 Haziran 2024

Kabul Tarihi

14 Temmuz 2024

Yayımlandığı Sayı

Yıl 2024 Cilt: 8 Sayı: 2

Kaynak Göster

APA
Derin, M. O., & Uçar, E. (2024). Augmenting parametric data synthesis with 3D simulation for OCR on Old Turkic runiform inscriptions: A case study of the Kül Tegin inscription. Journal of Old Turkic Studies, 8(2), 278-301. https://doi.org/10.35236/jots.1501797

Cited By