A NAMED ENTITY RECOGNITION MODEL FOR TURKISH LECTURE NOTES IN HISTORY AND GEOGRAPHY DOMAINS
Abstract
Named entity recognition (NER) is an information extraction (IE) task that is in the scope of natural language processing (NLP) and text mining. Its extent and methods may differ between studies, but basically, it aims to detect expressions that indicates a person, location, organization etc. In this study, a NER structure is developed for Turkish lecture notes (for history and geography courses). Separately, this structure is a project that is specialized for an information extraction task. Besides, it also has an educational value, as the projected outcome from its execution is meaningful words or word groups from the content of input lecture notes, which can be used to construct glossary of terms structures for individual courses or course subjects. With these glossary of terms structures, it is aimed to detect expressions in the content of a lecture note that can be used for questions and support a test preparation process. In this document, general information about NER task and its scope is given; previous studies on the field are mentioned; the system developed in line with this study is introduced; success of the system is evaluated through experiment results and some thoughts for enhancement are shared.
Keywords
References
- Alfonseca, E., Manandhar S. (2002). “An unsupervised method for general named entity recognition and automated concept discovery”. In 1st International Conference on General WordNet.
- Cucerzan, S., Yarowsky, D. (1999). “Language independent named entity recognition combining morphological and contextual evidence”. Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. New Brunswick, NJ: Association for Computational Linguistics.
- Ertopçu, B., Kanburoğlu, A., Topsakal, O., Açıkgöz, O., Gürkan, A., Özenç, B., Çam, İ., Avar, B., Ercan, G., Yıldız, O. (2017). “A new approach for named entity recognition”. In: International Conference on Computer Science and Engineering (UBMK), Antalya, Turkey, 2017.
- Grishman, A., Sundheim, B. (1996). “Message Understanding Conference-6: a brief history”. In Proceedings of the 16th conference on Computational linguistics - Volume 1 (COLING '96), Vol. 1. Association for Computational Linguistics, Stroudsburg, PA, USA, 466-471.
- Jurafsky, D., Martin, J.H. (2009). “Speech and language processing (2nd Edition)”. Prentice-Hall, Inc., Upper Saddle River, NJ, USA.
- Küçük, D., Jacquet, G., Steinberger, R. (2014). “Named entity recognition on Turkish tweets”. In: Language Resources and Evaluation Conference, 2014.
- Küçük, D., Küçük, D., Arıcı, N. (2016). “A named entity recognition dataset for Turkish”. In: 24th Signal Processing and Communications Applications Conference (SIU), Zonguldak, Turkey, 2016.
- Küçük, D., Yazıcı, A. (2009). “Named entity recognition experiments on Turkish texts”. In Proceedings of the 8th International Conference on Flexible Query Answering Systems, FQAS ’09, pages 524–535, Berlin, Heidelberg. Springer-Verlag.
Details
Primary Language
English
Subjects
Computer Software
Journal Section
Research Article
Publication Date
September 15, 2019
Submission Date
July 26, 2018
Acceptance Date
April 3, 2019
Published in Issue
Year 2019 Volume: 7 Number: 3