EN
TR
A Turkish Word Frequency Tool: LexiTR Frequency
Öz
Word frequency is a fundamental concept in linguistics, computational linguistics, natural language processing (NLP) and language education. Word frequency plays a critical role in understanding the characteristics and usage patterns of a word. This study introduces the "Turkish Word Frequency Tool" (TWFT), developed as part of the LexiTR Project, along with its features. TWFT is based on a balanced corpus consisting of over 193 million words from four distinct text types: academic, social media, fictional, and informative texts. TWFT serves a scalable online platform that provides researchers with the ability to examine word usage trends across different text types. It enables comprehensive analyses through real-time querying, graphical data representation, and both raw and normalized frequency values. Additionally, it provides API support, presenting word frequency information in a structured format. By filling a significant gap in the existing literature, TWFT aims to establish a consistent, transparent, and comprehensive foundation for linguistic research and natural language processing applications.
Anahtar Kelimeler
Kaynakça
- Akın, A. A. ve Akın, M. D. (2007). Zemberek, an open source NLP framework for Turkic languages. Structure, 10(2007), 1-5.
- Arslan, K. ve Bay, Y. (2023). İlkokul Türkçe ders kitaplarının söz varlığı bakımından incelenmesi. Turkish Journal of Primary Education, 8(1), 14-27.
- Baş, B. (2011). Söz varlığı ile ilgili çalışmalarda kullanılacak ölçütler. Türklük Bilimi Araştırmaları, (29), 27-61.
- Başaran, B. (2022). Measuring word frequency in language teaching textbooks using LexiTürk. International Online Journal of Education and Teaching (IOJET), 9(1), 571-583.
- Çal, A. (2015). Türkiye’de farklı dönemlere ait kelime sıklığı çalışmaları üzerine bir değerlendirme. Turkish Studies: International Periodical for the Languages, Literature and History of Turkish or Turkic, 10(8), 715-730.
- Çınar, İ. ve İnce, B. (2015). Türkçe ve Türk kültürü ders kitaplarındaki söz varlığına derlem temelli bir bakış. International Journal of Languages' Education and Teaching, 3(1), 198-209.
- Davies, M. (2009). The 385+ million word Corpus of Contemporary American English (1990–2008+): Design, architecture, and linguistic insights. International journal of corpus linguistics, 14(2), 159-190.
- Douglas, B. (1995). Dimensions of register variation: A cross-linguistic comparison. Cambridge: Cambridge University Press.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Türkçe Eğitimi
Bölüm
Araştırma Makalesi
Yayımlanma Tarihi
30 Nisan 2025
Gönderilme Tarihi
9 Şubat 2025
Kabul Tarihi
17 Mart 2025
Yayımlandığı Sayı
Yıl 2025 Cilt: 13 Sayı: 2
APA
Sezer, T., & Karadağ, Ö. (2025). A Turkish Word Frequency Tool: LexiTR Frequency. Ana Dili Eğitimi Dergisi, 13(2), 266-276. https://doi.org/10.16916/aded.1636416
AMA
1.Sezer T, Karadağ Ö. A Turkish Word Frequency Tool: LexiTR Frequency. Lisans. 2025;13(2):266-276. doi:10.16916/aded.1636416
Chicago
Sezer, Taner, ve Özay Karadağ. 2025. “A Turkish Word Frequency Tool: LexiTR Frequency”. Ana Dili Eğitimi Dergisi 13 (2): 266-76. https://doi.org/10.16916/aded.1636416.
EndNote
Sezer T, Karadağ Ö (01 Nisan 2025) A Turkish Word Frequency Tool: LexiTR Frequency. Ana Dili Eğitimi Dergisi 13 2 266–276.
IEEE
[1]T. Sezer ve Ö. Karadağ, “A Turkish Word Frequency Tool: LexiTR Frequency”, Lisans, c. 13, sy 2, ss. 266–276, Nis. 2025, doi: 10.16916/aded.1636416.
ISNAD
Sezer, Taner - Karadağ, Özay. “A Turkish Word Frequency Tool: LexiTR Frequency”. Ana Dili Eğitimi Dergisi 13/2 (01 Nisan 2025): 266-276. https://doi.org/10.16916/aded.1636416.
JAMA
1.Sezer T, Karadağ Ö. A Turkish Word Frequency Tool: LexiTR Frequency. Lisans. 2025;13:266–276.
MLA
Sezer, Taner, ve Özay Karadağ. “A Turkish Word Frequency Tool: LexiTR Frequency”. Ana Dili Eğitimi Dergisi, c. 13, sy 2, Nisan 2025, ss. 266-7, doi:10.16916/aded.1636416.
Vancouver
1.Taner Sezer, Özay Karadağ. A Turkish Word Frequency Tool: LexiTR Frequency. Lisans. 01 Nisan 2025;13(2):266-7. doi:10.16916/aded.1636416
