Research Article

ON THE EFFECT OF WORD POSITIONS IN GRAPH-BASED KEYWORD EXTRACTION

Volume: 17 Number: 2 November 8, 2021
EN TR

ON THE EFFECT OF WORD POSITIONS IN GRAPH-BASED KEYWORD EXTRACTION

Abstract

In this study, we focus on the effect of word positions in unsupervised, graph-based keyword extraction. To this aim, we discuss the performance of four node-weighting procedures, namely Word Position (WP), Word Position Bidirectional (WPB), Sentence Position (SP), and Sentence Position Bidirectional (SPB). WP assigns higher weights to words that appear at the beginning of a text. WPB assigns higher weights to words that appear either at the beginning or end of a text. SP assigns higher weights to words that appear in the very first sentences of a text. SPB assigns higher weights to words that appear in sentences that are either close to the beginning or end of a text. Experiments conducted on six benchmark datasets show that WP and SP do not statistically differ. However, for datasets whose keywords appear early in the text WP performs better than SP with no statistical difference, while for datasets where keywords are evenly distributed in text SP statistically performs better than WP.

Keywords

Supporting Institution

TÜBİTAK

Project Number

117E566

References

  1. Armouty, B., & Tedmori, S. (2019). “Automated Keyword Extraction using Support Vector Machine from Arabic News Documents”. In 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT), IEEE, 342-346. doi:10.1109/JEEIT.2019.8717420.
  2. Anju, R. C., Ramesh, S. H., & Rafeeque, P. C. (2018). “Keyphrase and Relation Extraction from Scientific Publications”. In Damodar Reddy Edla, Pawan Lingras, and Venkatanareshbabu K. (Eds), Advances in Machine Learning and Data Science: Recent Achievements and Research Directives (pp. 113-120). Vol. 705, Singapore, Springer.
  3. Augenstein, I., Das, M., Riedel, S., Vikraman, L., & McCallum, A. (2017). “Semeval 2017 Task 10: ScienceIE-Extracting Keyphrases and Relations from Scientific Publications”. arXiv preprint arXiv:1704.02853. Retrieved from https://arxiv.org/pdf/1704.02853.pdf
  4. Azcarraga, A., Liu, M. D., & Setiono, R. (2012, June). “Keyword extraction using backpropagation neural networks and rule extraction”. In the 2012 International Joint Conference on Neural Networks (IJCNN), IEEE, 1-7. doi:10.1109/IJCNN.2012.6252618.
  5. Beliga, S. (2014). “Keyword Extraction: A Review of Methods and Approaches”. University of Rijeka, Department of Informatics, Rijeka, 1-9.
  6. Bellaachia, A., & Al-Dhelaan, M. (2014). “HG-Rank: A Hypergraph-based Keyphrase Extraction for Short Documents in Dynamic Genre”. In #MSM, #Microposts2014, 4th Workshop on Making Sense of Micropost, 42-49.
  7. Biswas, S. K. (2019). “Keyword Extraction from Tweets Using Weighted Graph”. In Pradeep Kumar Mallick, Valentina Emilia Balas, Akash Kumar Bhoi, and Ahmed F. Zobaa (Eds.), Cognitive Informatics and Soft Computing: Proceeding of CISC 2017 (pp. 475-483). Vol 768, Singapore, Springer.
  8. Biswas, S. K., Bordoloi, M., & Shreya, J. (2018). “A graph based keyword extraction model using collective node weight”. Expert Systems with Applications, Vol. 97, 51-59. doi:10.1016/j.eswa.2017.12.025.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

November 8, 2021

Submission Date

November 28, 2020

Acceptance Date

June 25, 2021

Published in Issue

Year 2021 Volume: 17 Number: 2

APA
Kabasakal, O., & Mutlu, A. (2021). ON THE EFFECT OF WORD POSITIONS IN GRAPH-BASED KEYWORD EXTRACTION. Journal of Naval Sciences and Engineering, 17(2), 217-239. https://izlik.org/JA22GN66JT
AMA
1.Kabasakal O, Mutlu A. ON THE EFFECT OF WORD POSITIONS IN GRAPH-BASED KEYWORD EXTRACTION. JNSE. 2021;17(2):217-239. https://izlik.org/JA22GN66JT
Chicago
Kabasakal, Osman, and Alev Mutlu. 2021. “ON THE EFFECT OF WORD POSITIONS IN GRAPH-BASED KEYWORD EXTRACTION”. Journal of Naval Sciences and Engineering 17 (2): 217-39. https://izlik.org/JA22GN66JT.
EndNote
Kabasakal O, Mutlu A (November 1, 2021) ON THE EFFECT OF WORD POSITIONS IN GRAPH-BASED KEYWORD EXTRACTION. Journal of Naval Sciences and Engineering 17 2 217–239.
IEEE
[1]O. Kabasakal and A. Mutlu, “ON THE EFFECT OF WORD POSITIONS IN GRAPH-BASED KEYWORD EXTRACTION”, JNSE, vol. 17, no. 2, pp. 217–239, Nov. 2021, [Online]. Available: https://izlik.org/JA22GN66JT
ISNAD
Kabasakal, Osman - Mutlu, Alev. “ON THE EFFECT OF WORD POSITIONS IN GRAPH-BASED KEYWORD EXTRACTION”. Journal of Naval Sciences and Engineering 17/2 (November 1, 2021): 217-239. https://izlik.org/JA22GN66JT.
JAMA
1.Kabasakal O, Mutlu A. ON THE EFFECT OF WORD POSITIONS IN GRAPH-BASED KEYWORD EXTRACTION. JNSE. 2021;17:217–239.
MLA
Kabasakal, Osman, and Alev Mutlu. “ON THE EFFECT OF WORD POSITIONS IN GRAPH-BASED KEYWORD EXTRACTION”. Journal of Naval Sciences and Engineering, vol. 17, no. 2, Nov. 2021, pp. 217-39, https://izlik.org/JA22GN66JT.
Vancouver
1.Osman Kabasakal, Alev Mutlu. ON THE EFFECT OF WORD POSITIONS IN GRAPH-BASED KEYWORD EXTRACTION. JNSE [Internet]. 2021 Nov. 1;17(2):217-39. Available from: https://izlik.org/JA22GN66JT