Research Article

Performance Evaluation of the Extractive Methods in Automatic Text Summarization Using Medical Papers

Volume: 9 Number: 4 December 31, 2023
TR EN

Performance Evaluation of the Extractive Methods in Automatic Text Summarization Using Medical Papers

Abstract

The rapid advancement of technology has resulted in a surge in the volume of digital data available. This situation creates a problem for users who need assistance locating specific information inside this massive data collection, resulting in a time-consuming process. Automatic Text Summarizing systems have been developed as a more effective solution to conventional summary techniques to address this issue and improve users' access to relevant information. It is well known that, because of their busy schedules, researchers in the field of health sciences find it challenging to keep up with the most recent literature. The goal of this study is to generate comprehensive summaries of Turkish-language scientific papers in the field of health sciences. Although abstracts are already present in scientific papers, more thorough summaries are still required. To the best of our knowledge, no previous attempt has been made to automatically summarize academic papers on health in the Turkish language. For this, a dataset of 105 Turkish papers from DergiPark was collected. Term Frequency, Term Frequency-Inverse Document Frequency, Latent Semantic Analysis, TextRank, and Latent Dirichlet Allocation algorithms were chosen as extractive text summarization methods due to their frequent usage in this field. The performance of the text summarization models was evaluated using Recall, Precision, and F-score metrics, and the algorithms gave satisfying results for Turkish.

Keywords

automatic text summarization; extractive method; scientific papers; health sciences

References

  1. [1] J. P. Andersen, M. W. Nielsen, N. L. Simone, R. E. Lewiss, and R. Jagsi, “COVID-19 medical papers have fewer women first authors than expected,” eLife, vol. 9, June 2020. doi:10.7554/elife.58807
  2. [2] A. See, P. J. Liu, and C. D. Manning, “Get to the point: Summarization with pointer-generator networks,” arxiv.org, April 2017 [Online]. Available: arXiv, https://arxiv.org/abs/1704.04368. [Accessed: 15 Dec. 2023].
  3. [3} S. Narayan, S. B. Cohen, and M. Lapata, “Ranking sentences for extractive summarization with reinforcement learning,” arxiv.org, April 2018 [Online]. Available: arXiv, https://arxiv.org/abs/1802.08636. [Accessed: 15 Dec. 2023].
  4. [4] E. Erdağı, “Extractive based automatic text summarization in Turkish texts,” Ph.D. dissertation, Maltepe University, İstanbul, Turkey, 2023.
  5. [5] Ö. E. Gündoğdu and N. Duru, “Turkish text summarization and methods,” in Proc. of 18th Academic Computing Conference -AB 2016, Aydın, Turkey, January 30-February 5, 2016, pp. 69–76.
  6. [6] DergiPark, DergiPark Akademik [Online]. Available: DergiPark Akademik, https://dergipark.org.tr/tr/. [Accessed: 16 Dec. 2023].
  7. [7] J. Beel, B. Gipp, S. Langer, and C. Breitinger, “Research-paper recommender systems: A literature survey,” International Journal on Digital Libraries, vol. 17, no. 4, pp. 305–338, July 2015. doi:10.1007/s00799-015-0156-0
  8. [8] A. Güran, "Automatic text summarization system," Ph.D. dissertation, Yıldız Technical University, Istanbul, Turkey, 2013.
  9. [9] O. Kaynar, Y. Işık, Y. Görmez, and F. Demirkoparan “Genetic algorithm based sentence extraction for automatic text summarization,” Journal of Management Information Systems, vol. 3, no. 2, pp. 62-75, December 2017.
  10. [10] H. Torun and A. B. Inner, "Detecting similar news by summarizing Turkish news," in Proc. of 2018 26th Signal Processing and Communications Applications Conference (SIU), Izmir, Turkey, 2018, pp. 1-4. doi: 10.1109/SIU.2018.8404826
IEEE
[1]A. Kuş and Ç. İ. Acı, “Performance Evaluation of the Extractive Methods in Automatic Text Summarization Using Medical Papers”, GJES, vol. 9, no. 4, pp. 14–22, Dec. 2023, [Online]. Available: https://izlik.org/JA95MW86HM