Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs

Mehmet Oguz Derin; Erdem Uçar

doi:10.35236/jots.1762980

TR EN

Eski Uygur OCR: VLM’lerin İnce Ayarının Yeniden Üretilmesi Yoluyla Devam Eden İlk Çalışma

Öz

Old Uyghur manuscripts remain largely inaccessible due to the absence of optical character recognition (OCR) systems aligned with modern scholarly practices. To address this, we present a work‑in‑progress OCR approach using fine-tuning of vision–language models (VLMs) for Old Uyghur transliteration. Specifically, LLaMA‑3.2‑11B‑Vision was fine‑tuned with 4‑bit Unsloth quantization and LoRA on 525 manually annotated pages from Radloff and Malov’s edition of Altun Yaruk Sudur, following Unicode‑based transliteration guidelines. On a held‑out test set of 16 pages from one bölük (part) of the fifth tägzinč, the approach yielded a character error rate (CER) of 5.46% and normalized edit distance (NED) of 0.286. These results demonstrate feasibility; however, limitations are evident owing to binarized scans, a single woodblock style, small training data, and lack of positional supervision. Consequently, findings provide a baseline and a reproducible path toward robust Old Uyghur OCR.

Anahtar Kelimeler

Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs

Öz

Old Uyghur manuscripts remain largely inaccessible due to the absence of optical character recognition (OCR) systems aligned with modern scholarly practices. To address this, we present a work‑in‑progress OCR approach using fine-tuning of vision–language models (VLMs) for Old Uyghur transliteration. Specifically, LLaMA‑3.2‑11B‑Vision was fine‑tuned with 4‑bit Unsloth quantization and LoRA on 525 manually annotated pages from Radloff and Malov’s edition of Altun Yaruk Sudur, following Unicode‑based transliteration guidelines. On a held‑out test set of 16 pages from one bölük (part) of the fifth tägzinč, the approach yielded a character error rate (CER) of 5.46% and normalized edit distance (NED) of 0.286. These results demonstrate feasibility; however, limitations are evident owing to binarized scans, a single woodblock style, small training data, and lack of positional supervision. Consequently, findings provide a baseline and a reproducible path toward robust Old Uyghur OCR.

Anahtar Kelimeler

Kaynakça

Chung, Y. H. M., & Choi, D. (2025). Fine‑tuning vision‑language models as OCR systems for low‑resource languages: A case study of Manchu. arXiv: 2507.06761.
Derin, M. O. & Harada, T. (2021). Universal Dependencies for Old Turkish. Proceedings of the Fifth Workshop on Universal Dependencies (pp. 129–141). Association for Computational Linguistics.
Han, D. et al. (2023). Unsloth: Finetune LLMs 2× faster with 70% less memory. (https://github.com/unslothai/unsloth)
Radlov, V. V. & Malov, S. E. (1913-1917). Suvarṇaprabhāsa (Sutra zolotogo bleska), Tekst’ Uygurskoy Redaktsiy, izdali I-IV. St. Petersburg: Imperatorskoy Akad. Nauk.
Uçar, E. (2023). Altun Yaruq Sudur’un St. Petersburg Yazması’ndaki Yapraklar ile Radlov-Malov Neşrindeki Sayfaların Tekabül Listesi (Deneme). Babür, 2(2), 35-63.
Uçar, E. & Derin, M. O. (2025). Die digitale Zukunft der sogdischen und altuigurischen Schrift: Entwicklung und Etablierung von Standards für Transliteration, Transkription und Unicode‑Kodierung. Journal of Old Turkic Studies, 9(2), 535-552.
Zieme, P. (2015). Local Literatures: Uighur. In Silk J. A. et al. (Eds.), Brill’s Encyclopedia of Buddhism. Vol. I: Literature and Languages (pp. 871-882). Leiden-Boston: Brill.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Dilbilim (Diğer)

Bölüm

Araştırma Makalesi

Yazarlar

Mehmet Oguz Derin ^*
0000-0002-6264-3509
Türkiye

Erdem Uçar
0000-0002-0039-9619
Germany

Erken Görünüm Tarihi

17 Ağustos 2025

Yayımlanma Tarihi

18 Ağustos 2025

Gönderilme Tarihi

12 Ağustos 2025

Kabul Tarihi

15 Ağustos 2025

Yayımlandığı Sayı

Yıl 2025 Cilt: 9 Sayı: 2

DOI

https://doi.org/10.35236/jots.1762980

IZ

https://izlik.org/JA59JZ76TT

Kaynak Göster

RIS / Bibtex

APA

Derin, M. O., & Uçar, E. (2025). Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs. Journal of Old Turkic Studies, 9(2), 313-318. https://doi.org/10.35236/jots.1762980

AMA

1.Derin MO, Uçar E. Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs. JOTS. 2025;9(2):313-318. doi:10.35236/jots.1762980

Chicago

Derin, Mehmet Oguz, ve Erdem Uçar. 2025. “Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs”. Journal of Old Turkic Studies 9 (2): 313-18. https://doi.org/10.35236/jots.1762980.

EndNote

Derin MO, Uçar E (01 Ağustos 2025) Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs. Journal of Old Turkic Studies 9 2 313–318.

IEEE

[1]M. O. Derin ve E. Uçar, “Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs”, JOTS, c. 9, sy 2, ss. 313–318, Ağu. 2025, doi: 10.35236/jots.1762980.

ISNAD

Derin, Mehmet Oguz - Uçar, Erdem. “Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs”. Journal of Old Turkic Studies 9/2 (01 Ağustos 2025): 313-318. https://doi.org/10.35236/jots.1762980.

JAMA

1.Derin MO, Uçar E. Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs. JOTS. 2025;9:313–318.

MLA

Derin, Mehmet Oguz, ve Erdem Uçar. “Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs”. Journal of Old Turkic Studies, c. 9, sy 2, Ağustos 2025, ss. 313-8, doi:10.35236/jots.1762980.

Vancouver

1.Mehmet Oguz Derin, Erdem Uçar. Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs. JOTS. 01 Ağustos 2025;9(2):313-8. doi:10.35236/jots.1762980