<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN"
        "https://jats.nlm.nih.gov/publishing/1.4/JATS-journalpublishing1-4.dtd">
<article  article-type="research-article"        dtd-version="1.4">
            <front>

                <journal-meta>
                                                                <journal-id>kojose</journal-id>
            <journal-title-group>
                                                                                    <journal-title>Kocaeli Journal of Science and Engineering</journal-title>
            </journal-title-group>
                                        <issn pub-type="epub">2667-484X</issn>
                                                                                            <publisher>
                    <publisher-name>Kocaeli University</publisher-name>
                </publisher>
                    </journal-meta>
                <article-meta>
                                        <article-id pub-id-type="doi">10.34088/kojose.871873</article-id>
                                                                <article-categories>
                                            <subj-group  xml:lang="en">
                                                            <subject>Computer Software</subject>
                                                    </subj-group>
                                            <subj-group  xml:lang="tr">
                                                            <subject>Bilgisayar Yazılımı</subject>
                                                    </subj-group>
                                    </article-categories>
                                                                                                                                                        <title-group>
                                                                                                                                                            <article-title>Named Entity Recognition in Turkish Bank Documents</article-title>
                                                                                                    </title-group>
            
                                                    <contrib-group content-type="authors">
                                                                        <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0003-1187-5147</contrib-id>
                                                                <name>
                                    <surname>Kabasakal</surname>
                                    <given-names>Osman</given-names>
                                </name>
                                                                    <aff>KOCAELİ ÜNİVERSİTESİ</aff>
                                                            </contrib>
                                                    <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0003-0547-0653</contrib-id>
                                                                <name>
                                    <surname>Mutlu</surname>
                                    <given-names>Alev</given-names>
                                </name>
                                                                    <aff>KOCAELİ ÜNİVERSİTESİ</aff>
                                                            </contrib>
                                                                                </contrib-group>
                        
                                        <pub-date pub-type="pub" iso-8601-date="20211130">
                    <day>11</day>
                    <month>30</month>
                    <year>2021</year>
                </pub-date>
                                        <volume>4</volume>
                                        <issue>2</issue>
                                        <fpage>86</fpage>
                                        <lpage>92</lpage>
                        
                        <history>
                                    <date date-type="received" iso-8601-date="20210131">
                        <day>01</day>
                        <month>31</month>
                        <year>2021</year>
                    </date>
                                                    <date date-type="accepted" iso-8601-date="20210413">
                        <day>04</day>
                        <month>13</month>
                        <year>2021</year>
                    </date>
                            </history>
                                        <permissions>
                    <copyright-statement>Copyright © 2018, Kocaeli Journal of Science and Engineering</copyright-statement>
                    <copyright-year>2018</copyright-year>
                    <copyright-holder>Kocaeli Journal of Science and Engineering</copyright-holder>
                </permissions>
            
                                                                                                                        <abstract><p>Named Entity Recognition (NER) is the process of automatically recognizing entity names such as person, organization, and date in a document.  In this study, we focus on bank documents written in Turkish and propose a Conditional Random Fields (CRF) model to extract named entities. The main contribution of this study is twofold: (i) we propose domain-specific features to extract entity names such as law, regulation, and reference which frequently appear in bank documents; and (ii) we contribute to NER research in Turkish document which is not as mature as other languages such as English and German. Experimental results based on 10-fold cross validation conducted on 551 real-life, anonymized bank documents show the proposed CRF-NER model achieves 0.962 micro average F1 score. More specifically, F1 score for the identification of law names is 0.979, regulation name is 0.850, and article no is 0.850.</p></abstract>
                                                            
            
                                                                                        <kwd-group>
                                                    <kwd>Bank Document</kwd>
                                                    <kwd>  Conditional Random Fields</kwd>
                                                    <kwd>  Named Entity Recognition</kwd>
                                                    <kwd>  Natural Language Processing</kwd>
                                                    <kwd>  Turkish Documents</kwd>
                                            </kwd-group>
                            
                                                                                                                                                <funding-group specific-use="FundRef">
                    <award-group>
                                                    <funding-source>
                                <named-content content-type="funder_name">TÜBİTAK</named-content>
                            </funding-source>
                                                                            <award-id>5190074</award-id>
                                            </award-group>
                </funding-group>
                                </article-meta>
    </front>
    <back>
                            <ref-list>
                                    <ref id="ref1">
                        <label>1</label>
                        <mixed-citation publication-type="journal">[1]  Nagy I., Berend G., Vincze V., 2011. Noun compound and named entity recognition and their usability in keyphrase extraction. International Conference Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September.</mixed-citation>
                    </ref>
                                    <ref id="ref2">
                        <label>2</label>
                        <mixed-citation publication-type="journal">[2] 	Rodrigo A., Perez-Iglesias J., Penas A., Garrido G., Araujo L., 2013. Answering questions about European legislation. Expert Systems with Applications, 40(15), pp. 5811-5816.</mixed-citation>
                    </ref>
                                    <ref id="ref3">
                        <label>3</label>
                        <mixed-citation publication-type="journal">[3] 	Cao T. H., Tang T. M., Chau C. K., 2012. Text clustering with named entities: a model, experimentation and realization. In Data mining: Foundations and intelligent paradigms, Springer, Berlin, Heidelberg.</mixed-citation>
                    </ref>
                                    <ref id="ref4">
                        <label>4</label>
                        <mixed-citation publication-type="journal">[4] 	Hassel M., 2003. Exploitation of named entities in automatic text summarization for Swedish. 14th Nordic Conference on Computational Linguistics, Reykjavik, Iceland, 30-31 May.</mixed-citation>
                    </ref>
                                    <ref id="ref5">
                        <label>5</label>
                        <mixed-citation publication-type="journal">[5] 	Grishman R., Sundheim B. M., 1996. Message Understanding Conference – 6: A brief history. The 16th International Conference on Computational Linguistics, Copenhagen, Denmark, 5-9 August.</mixed-citation>
                    </ref>
                                    <ref id="ref6">
                        <label>6</label>
                        <mixed-citation publication-type="journal">[6] 	Black W. J., Rinaldi F., Mowatt D., 1998. FACILE: Description of the NE System Used for MUC-7. 7th Message Understanding Conference, Fairfax, Virginia, 29 April – 1 May.</mixed-citation>
                    </ref>
                                    <ref id="ref7">
                        <label>7</label>
                        <mixed-citation publication-type="journal">[7] 	Aone C., Halverson L., Hampton T., Ramos-Santacruz M., 1998. SRA: Description of the IE2 system used for MUC-7. 7th Message Understanding Conference, Fairfax, Virginia, 29 April – 1 May.</mixed-citation>
                    </ref>
                                    <ref id="ref8">
                        <label>8</label>
                        <mixed-citation publication-type="journal">[8] 	Nadeau D., Turney P. D., Matwin S., 2006. Unsupervised named-entity recognition: Generating gazetteers and resolving ambiguity. 19th Canadian Conference on Artificial Intelligence, Quebec, Canada, 7-9 June.</mixed-citation>
                    </ref>
                                    <ref id="ref9">
                        <label>9</label>
                        <mixed-citation publication-type="journal">[9] 	Özkaya S., Diri B., 2011. Named entity recognition by conditional random fields from Turkish informal texts. 19th Signal Processing and Communications Applications Conference, Antalya, 20-22 April.</mixed-citation>
                    </ref>
                                    <ref id="ref10">
                        <label>10</label>
                        <mixed-citation publication-type="journal">[10] 	Lin W., Ji D., Lu Y., 2017. Disorder recognition in clinical texts using multi-label structured SVM. BMC bioinformatics, 18(1), 75, pp. 1-11.</mixed-citation>
                    </ref>
                                    <ref id="ref11">
                        <label>11</label>
                        <mixed-citation publication-type="journal">11] 	Zhang M., Geng G., Chen J., 2020. Semi-Supervised Bidirectional Long Short-Term Memory and Conditional Random Fields Model for Named-Entity Recognition Using Embeddings from Language Models Representations. Entropy, 22(2), pp. 252.</mixed-citation>
                    </ref>
                                    <ref id="ref12">
                        <label>12</label>
                        <mixed-citation publication-type="journal">[12] 	Zhu Q., Li X., Conesa A., Pereira C., 2018. GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text. Bioinformatics, 34(9), pp. 1547-1554.</mixed-citation>
                    </ref>
                                    <ref id="ref13">
                        <label>13</label>
                        <mixed-citation publication-type="journal">[13] 	Korvigo I., Holmatov M., Zaikovskii A., Skoblov M., 2018. Putting hands to rest: efficient deep CNN-RNN architecture for chemical named entity recognition with no hand-crafted rules. Journal of cheminformatics, 10(1), pp. 1-10.</mixed-citation>
                    </ref>
                                    <ref id="ref14">
                        <label>14</label>
                        <mixed-citation publication-type="journal">[14] 	Li J., Sun A., Han J., Li C., 2020. A survey on deep learning for named entity recognition. IEEE Transactions on Knowledge and Data Engineering.</mixed-citation>
                    </ref>
                                    <ref id="ref15">
                        <label>15</label>
                        <mixed-citation publication-type="journal">[15] 	Chen Y., Zhou C., Li T., Wu H., Zhao X., Ye K., Liao J., 2019. Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training. Journal of Biomedical Informatics, 96, pp. 103252.</mixed-citation>
                    </ref>
                                    <ref id="ref16">
                        <label>16</label>
                        <mixed-citation publication-type="journal">[16] 	Leitner E., Rehm G., Moreno-Schneider J., 2019. Fine-grained Named Entity Recognition in Legal Documents. 15th International Conference on Semantic Systems, Karlsruhe, Germany, 9-12 September.</mixed-citation>
                    </ref>
                                    <ref id="ref17">
                        <label>17</label>
                        <mixed-citation publication-type="journal">[17] 	Leaman R., Wei C. H., Lu Z., 2015. tmChem: a high performance approach for chemical named entity recognition and normalization. Journal of cheminformatics, 7(S1), S3.</mixed-citation>
                    </ref>
                                    <ref id="ref18">
                        <label>18</label>
                        <mixed-citation publication-type="journal">[18] 	Cucerzan S., Yarowsky D., 1999. Language independent named entity recognition combining morphological and contextual evidence. In 1999 joint SIGDAT conference on empirical methods in natural language processing and very large corpora.</mixed-citation>
                    </ref>
                                    <ref id="ref19">
                        <label>19</label>
                        <mixed-citation publication-type="journal">[19] 	Küçük D., 2009. Named entity recognition experiments on Turkish texts. In International Conference on Flexible Query Answering Systems. Springer, Berlin, Heidelberg.</mixed-citation>
                    </ref>
                                    <ref id="ref20">
                        <label>20</label>
                        <mixed-citation publication-type="journal">[20] 	Cekınel R. F., Ağriman M., Karagöz P., Yilmaz B., 2019. Named Entity Recognition with Conditional Random Fields on Turkish News Dataset: Revisiting the Features. 27th Signal Processing and Communications Applications Conference, Sivas, Turkey, 24-26 April.</mixed-citation>
                    </ref>
                                    <ref id="ref21">
                        <label>21</label>
                        <mixed-citation publication-type="journal">[21] 	Farmakiotou D., Karkaletsis V., Koutsias J., Sigletos G., Spyropoulos C. D., Stamatopoulos P., 2000. Rule-based named entity recognition for Greek financial texts. In Proceedings of the Workshop on Computational lexicography and Multimedia Dictionaries (COMLEX 2000).</mixed-citation>
                    </ref>
                                    <ref id="ref22">
                        <label>22</label>
                        <mixed-citation publication-type="journal">[22] 	Sheikh M., Conlon S., 2012. A rule-based system to extract financial information. Journal of Computer Information Systems, 52(4), pp. 10-19.</mixed-citation>
                    </ref>
                                    <ref id="ref23">
                        <label>23</label>
                        <mixed-citation publication-type="journal">[23] 	Wang S., Xu R., Liu B., Gui L., Zhou Y., 2014. Financial named entity recognition based on conditional random fields and information entropy. In 2014 International Conference on Machine Learning and Cybernetics, Lanzhou, China, 13-16 July</mixed-citation>
                    </ref>
                                    <ref id="ref24">
                        <label>24</label>
                        <mixed-citation publication-type="journal">[24] 	Alvarado J. C. S., Verspoor K., Baldwin T., 2015. Domain adaption of named entity recognition to support credit risk assessment. In Proceedings of the Australasian Language Technology Association Workshop 2015</mixed-citation>
                    </ref>
                                    <ref id="ref25">
                        <label>25</label>
                        <mixed-citation publication-type="journal">[25] 	Bayraktar O., Temizel T. T., 2008. Person name extraction from Turkish financial news text using local grammar-based approach. In 2008 23rd International Symposium on Computer and Information Sciences (pp. 1-4). IEEE.</mixed-citation>
                    </ref>
                                    <ref id="ref26">
                        <label>26</label>
                        <mixed-citation publication-type="journal">[26] 	Leitner E., Rehm G., Moreno-Schneider J., 2019. Fine-grained Named Entity Recognition in Legal Documents. In International Conference on Semantic Systems (pp. 272-287). Springer, Cham.</mixed-citation>
                    </ref>
                                    <ref id="ref27">
                        <label>27</label>
                        <mixed-citation publication-type="journal">[27] 	Vardhan H., Surana N., Tripathy B. K., 2020. Named-Entity Recognition for Legal Documents. In International Conference on Advanced Machine Learning Technologies and Applications (pp. 469-479), Jaipur, India. Springer, Singapore, 13-15 February.</mixed-citation>
                    </ref>
                            </ref-list>
                    </back>
    </article>
