Araştırma Makalesi

Named Entity Recognition in Turkish Bank Documents

Cilt: 4 Sayı: 2 30 Kasım 2021
PDF İndir
EN

Named Entity Recognition in Turkish Bank Documents

Öz

Named Entity Recognition (NER) is the process of automatically recognizing entity names such as person, organization, and date in a document. In this study, we focus on bank documents written in Turkish and propose a Conditional Random Fields (CRF) model to extract named entities. The main contribution of this study is twofold: (i) we propose domain-specific features to extract entity names such as law, regulation, and reference which frequently appear in bank documents; and (ii) we contribute to NER research in Turkish document which is not as mature as other languages such as English and German. Experimental results based on 10-fold cross validation conducted on 551 real-life, anonymized bank documents show the proposed CRF-NER model achieves 0.962 micro average F1 score. More specifically, F1 score for the identification of law names is 0.979, regulation name is 0.850, and article no is 0.850.

Anahtar Kelimeler

Proje Numarası

5190074

Kaynakça

  1. [1] Nagy I., Berend G., Vincze V., 2011. Noun compound and named entity recognition and their usability in keyphrase extraction. International Conference Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September.
  2. [2] Rodrigo A., Perez-Iglesias J., Penas A., Garrido G., Araujo L., 2013. Answering questions about European legislation. Expert Systems with Applications, 40(15), pp. 5811-5816.
  3. [3] Cao T. H., Tang T. M., Chau C. K., 2012. Text clustering with named entities: a model, experimentation and realization. In Data mining: Foundations and intelligent paradigms, Springer, Berlin, Heidelberg.
  4. [4] Hassel M., 2003. Exploitation of named entities in automatic text summarization for Swedish. 14th Nordic Conference on Computational Linguistics, Reykjavik, Iceland, 30-31 May.
  5. [5] Grishman R., Sundheim B. M., 1996. Message Understanding Conference – 6: A brief history. The 16th International Conference on Computational Linguistics, Copenhagen, Denmark, 5-9 August.
  6. [6] Black W. J., Rinaldi F., Mowatt D., 1998. FACILE: Description of the NE System Used for MUC-7. 7th Message Understanding Conference, Fairfax, Virginia, 29 April – 1 May.
  7. [7] Aone C., Halverson L., Hampton T., Ramos-Santacruz M., 1998. SRA: Description of the IE2 system used for MUC-7. 7th Message Understanding Conference, Fairfax, Virginia, 29 April – 1 May.
  8. [8] Nadeau D., Turney P. D., Matwin S., 2006. Unsupervised named-entity recognition: Generating gazetteers and resolving ambiguity. 19th Canadian Conference on Artificial Intelligence, Quebec, Canada, 7-9 June.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Bilgisayar Yazılımı

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

30 Kasım 2021

Gönderilme Tarihi

31 Ocak 2021

Kabul Tarihi

13 Nisan 2021

Yayımlandığı Sayı

Yıl 2021 Cilt: 4 Sayı: 2

Kaynak Göster

APA
Kabasakal, O., & Mutlu, A. (2021). Named Entity Recognition in Turkish Bank Documents. Kocaeli Journal of Science and Engineering, 4(2), 86-92. https://doi.org/10.34088/kojose.871873
AMA
1.Kabasakal O, Mutlu A. Named Entity Recognition in Turkish Bank Documents. KOJOSE. 2021;4(2):86-92. doi:10.34088/kojose.871873
Chicago
Kabasakal, Osman, ve Alev Mutlu. 2021. “Named Entity Recognition in Turkish Bank Documents”. Kocaeli Journal of Science and Engineering 4 (2): 86-92. https://doi.org/10.34088/kojose.871873.
EndNote
Kabasakal O, Mutlu A (01 Kasım 2021) Named Entity Recognition in Turkish Bank Documents. Kocaeli Journal of Science and Engineering 4 2 86–92.
IEEE
[1]O. Kabasakal ve A. Mutlu, “Named Entity Recognition in Turkish Bank Documents”, KOJOSE, c. 4, sy 2, ss. 86–92, Kas. 2021, doi: 10.34088/kojose.871873.
ISNAD
Kabasakal, Osman - Mutlu, Alev. “Named Entity Recognition in Turkish Bank Documents”. Kocaeli Journal of Science and Engineering 4/2 (01 Kasım 2021): 86-92. https://doi.org/10.34088/kojose.871873.
JAMA
1.Kabasakal O, Mutlu A. Named Entity Recognition in Turkish Bank Documents. KOJOSE. 2021;4:86–92.
MLA
Kabasakal, Osman, ve Alev Mutlu. “Named Entity Recognition in Turkish Bank Documents”. Kocaeli Journal of Science and Engineering, c. 4, sy 2, Kasım 2021, ss. 86-92, doi:10.34088/kojose.871873.
Vancouver
1.Osman Kabasakal, Alev Mutlu. Named Entity Recognition in Turkish Bank Documents. KOJOSE. 01 Kasım 2021;4(2):86-92. doi:10.34088/kojose.871873