Araştırma Makalesi

Author Identification for Turkish Texts

Cilt: 1 Sayı: 7 1 Ağustos 2007
PDF İndir
EN TR

Author Identification for Turkish Texts

Öz

The main concern of author identification is to define an appropriate characterization of documents that captures the writing style of authors. The most important approaches to computer-based author identification are exclusively based on lexical measures. In this paper we presented a fully automated approach to the identification of the authorship of unrestricted text by adapting a set of style markers to the analysis of the text. In this study, 35 style markers were applied to each author. By using our method, the author of a text can be identified by using the style markers that characterize a group of authors. The author group consists of 20 different writers. Author features including style markers were derived together with different machine learning algorithms. By using our method we have obtained a success rate of 80% in avarege.

Anahtar Kelimeler

Kaynakça

  1. A. Genkin, D. D. Lewis, and D. Madigan, Large-scale bayesian logistic regression for text categorization, 2004.
  2. B.Diri, M. F. Amasyal›, Automatic Author Detection for Turkish Text, ICANN/ICONIP’03 13th International Conference on Artificial Neural Network and 10th International Conference on Neural Information Processing, 2003.
  3. B.Kessler, G. Nunberg, H.Schutze, Automatic Detection of Text Genre, Proc. of 35th Annual Meeting of the Association for Computational Linguistics (ACL/EACL’97), 32-38 1997.
  4. Chris Callison-Burch, Co-training for Statistical Machine Translation, Master’s thesis, University of Edinburgh, 2002.
  5. Christopher D. Manning and Hinrich Schütze, Foundations of Statistical Natural Language Processing, The MIT Press, 1999.
  6. D. Biber, Variations Across Speech and Writing, Cambridge University Press, 1988.
  7. D. I. Holmes, Stylometry: Its Origins, Development and Aspirations, presented to the Joint International Conference of the Association for Computers and the Humanities and the Association for Literary and Linguistic Computing, Queen’s University, Kingston, Ontario, 1997.
  8. D. Khmelev, Disputed authorship resolution using relative entropy for markov chain of letters in a text, In R. Baayen, editor, 4th Conference Int. Quantitative Linguistics Association, Prague, 2000.

Ayrıntılar

Birincil Dil

Türkçe

Konular

-

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

1 Ağustos 2007

Gönderilme Tarihi

1 Şubat 2014

Kabul Tarihi

-

Yayımlandığı Sayı

Yıl 2007 Cilt: 1 Sayı: 7

Kaynak Göster

APA
Görür, T. T. A. K., & Taş, T. (2007). Author Identification for Turkish Texts. Cankaya University Journal of Arts and Sciences, 1(7), 151-161. https://izlik.org/JA33DU77ZC
AMA
1.Görür TTAK, Taş T. Author Identification for Turkish Texts. Cankaya University Journal of Arts and Sciences. 2007;1(7):151-161. https://izlik.org/JA33DU77ZC
Chicago
Görür, Tufan Taş, Abdul Kadir, ve Tufan Taş. 2007. “Author Identification for Turkish Texts”. Cankaya University Journal of Arts and Sciences 1 (7): 151-61. https://izlik.org/JA33DU77ZC.
EndNote
Görür TTAK, Taş T (01 Ağustos 2007) Author Identification for Turkish Texts. Cankaya University Journal of Arts and Sciences 1 7 151–161.
IEEE
[1]T. T. A. K. Görür ve T. Taş, “Author Identification for Turkish Texts”, Cankaya University Journal of Arts and Sciences, c. 1, sy 7, ss. 151–161, Ağu. 2007, [çevrimiçi]. Erişim adresi: https://izlik.org/JA33DU77ZC
ISNAD
Görür, Tufan Taş, Abdul Kadir - Taş, Tufan. “Author Identification for Turkish Texts”. Cankaya University Journal of Arts and Sciences 1/7 (01 Ağustos 2007): 151-161. https://izlik.org/JA33DU77ZC.
JAMA
1.Görür TTAK, Taş T. Author Identification for Turkish Texts. Cankaya University Journal of Arts and Sciences. 2007;1:151–161.
MLA
Görür, Tufan Taş, Abdul Kadir, ve Tufan Taş. “Author Identification for Turkish Texts”. Cankaya University Journal of Arts and Sciences, c. 1, sy 7, Ağustos 2007, ss. 151-6, https://izlik.org/JA33DU77ZC.
Vancouver
1.Tufan Taş, Abdul Kadir Görür, Tufan Taş. Author Identification for Turkish Texts. Cankaya University Journal of Arts and Sciences [Internet]. 01 Ağustos 2007;1(7):151-6. Erişim adresi: https://izlik.org/JA33DU77ZC