EN
Named Entity Recognition in Turkish Bank Documents
Öz
Named Entity Recognition (NER) is the process of automatically recognizing entity names such as person, organization, and date in a document. In this study, we focus on bank documents written in Turkish and propose a Conditional Random Fields (CRF) model to extract named entities. The main contribution of this study is twofold: (i) we propose domain-specific features to extract entity names such as law, regulation, and reference which frequently appear in bank documents; and (ii) we contribute to NER research in Turkish document which is not as mature as other languages such as English and German. Experimental results based on 10-fold cross validation conducted on 551 real-life, anonymized bank documents show the proposed CRF-NER model achieves 0.962 micro average F1 score. More specifically, F1 score for the identification of law names is 0.979, regulation name is 0.850, and article no is 0.850.
Anahtar Kelimeler
- Bank Document
- Conditional Random Fields
- Named Entity Recognition
- Natural Language Processing
- Turkish Documents
Proje Numarası
5190074
Kaynakça
- [1] Nagy I., Berend G., Vincze V., 2011. Noun compound and named entity recognition and their usability in keyphrase extraction. International Conference Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September.
- [2] Rodrigo A., Perez-Iglesias J., Penas A., Garrido G., Araujo L., 2013. Answering questions about European legislation. Expert Systems with Applications, 40(15), pp. 5811-5816.
- [3] Cao T. H., Tang T. M., Chau C. K., 2012. Text clustering with named entities: a model, experimentation and realization. In Data mining: Foundations and intelligent paradigms, Springer, Berlin, Heidelberg.
- [4] Hassel M., 2003. Exploitation of named entities in automatic text summarization for Swedish. 14th Nordic Conference on Computational Linguistics, Reykjavik, Iceland, 30-31 May.
- [5] Grishman R., Sundheim B. M., 1996. Message Understanding Conference – 6: A brief history. The 16th International Conference on Computational Linguistics, Copenhagen, Denmark, 5-9 August.
- [6] Black W. J., Rinaldi F., Mowatt D., 1998. FACILE: Description of the NE System Used for MUC-7. 7th Message Understanding Conference, Fairfax, Virginia, 29 April – 1 May.
- [7] Aone C., Halverson L., Hampton T., Ramos-Santacruz M., 1998. SRA: Description of the IE2 system used for MUC-7. 7th Message Understanding Conference, Fairfax, Virginia, 29 April – 1 May.
- [8] Nadeau D., Turney P. D., Matwin S., 2006. Unsupervised named-entity recognition: Generating gazetteers and resolving ambiguity. 19th Canadian Conference on Artificial Intelligence, Quebec, Canada, 7-9 June.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Bilgisayar Yazılımı
Bölüm
Araştırma Makalesi
Yayımlanma Tarihi
30 Kasım 2021
Gönderilme Tarihi
31 Ocak 2021
Kabul Tarihi
13 Nisan 2021
Yayımlandığı Sayı
Yıl 2021 Cilt: 4 Sayı: 2
APA
Kabasakal, O., & Mutlu, A. (2021). Named Entity Recognition in Turkish Bank Documents. Kocaeli Journal of Science and Engineering, 4(2), 86-92. https://doi.org/10.34088/kojose.871873
AMA
1.Kabasakal O, Mutlu A. Named Entity Recognition in Turkish Bank Documents. KOJOSE. 2021;4(2):86-92. doi:10.34088/kojose.871873
Chicago
Kabasakal, Osman, ve Alev Mutlu. 2021. “Named Entity Recognition in Turkish Bank Documents”. Kocaeli Journal of Science and Engineering 4 (2): 86-92. https://doi.org/10.34088/kojose.871873.
EndNote
Kabasakal O, Mutlu A (01 Kasım 2021) Named Entity Recognition in Turkish Bank Documents. Kocaeli Journal of Science and Engineering 4 2 86–92.
IEEE
[1]O. Kabasakal ve A. Mutlu, “Named Entity Recognition in Turkish Bank Documents”, KOJOSE, c. 4, sy 2, ss. 86–92, Kas. 2021, doi: 10.34088/kojose.871873.
ISNAD
Kabasakal, Osman - Mutlu, Alev. “Named Entity Recognition in Turkish Bank Documents”. Kocaeli Journal of Science and Engineering 4/2 (01 Kasım 2021): 86-92. https://doi.org/10.34088/kojose.871873.
JAMA
1.Kabasakal O, Mutlu A. Named Entity Recognition in Turkish Bank Documents. KOJOSE. 2021;4:86–92.
MLA
Kabasakal, Osman, ve Alev Mutlu. “Named Entity Recognition in Turkish Bank Documents”. Kocaeli Journal of Science and Engineering, c. 4, sy 2, Kasım 2021, ss. 86-92, doi:10.34088/kojose.871873.
Vancouver
1.Osman Kabasakal, Alev Mutlu. Named Entity Recognition in Turkish Bank Documents. KOJOSE. 01 Kasım 2021;4(2):86-92. doi:10.34088/kojose.871873