Araştırma Makalesi

Helmholtz-Based Automatic Document Summarization

Cilt: 5 Sayı: 1 10 Ekim 2022
PDF İndir
TR EN

Helmholtz-Based Automatic Document Summarization

Öz

Nowadays, the use of internet networks and social media has allowed people to express and interpret their opinions about other people or institutions easily and clearly. With the increasing prevalence of this opportunity, a growing rich content emerges. As a result, the analysis of big data obtained from the internet, transforming it into meaningful information, and using it is a subject that has been studied intensively in recent years. In this process, automatic text summarization has become an important task. In this study, the Helmholtz-based extractive summarization method is presented to create an automatic text summarization system. BBC News data set was used to test the proposed method. In this data set, there are both original full-text documents and summary documents of these original documents produced by human summarizers. The similarity of the summary document produced by the proposed Helmholtz-based extractive text summarization method with the original summary in the BBC News data set was calculated using the Simhash text similarity algorithm. When the results are examined, summary documents can be produced with 38.9% simhash similarity rate with the proposed Helmholtz-based extractive summarization method. In the Experiments section, the results obtained with other third-party extractive summarization algorithms are also shared.

Anahtar Kelimeler

Kaynakça

  1. Lee, J. H., Park, S., Ahn, C. M., & Kim, D. (2009). Automatic generic document summarization based on non-negative matrix factorization. Information Processing & Management, 45(1), 20-34.
  2. Torres-Moreno, J. M. (2014). Automatic text summarization. John Wiley & Sons.
  3. Joshi A., Fidalgo E., Alegre E., Fernández-Robles L. 2019. SummCoder: An Unsupervised Framework for Extractive Text Summarization Based on Deep Auto-encoders. Expert Syst Appl., doi: 10.1016/j.eswa.2019.03.045.
  4. Cigir C., Kutlu M., Cicekli I. 2009. Generic text summarization for Turkish. 2009 24th International Symposium on Computer and Information Sciences (IEEE), pp: 224-229.
  5. Luhn, H. P. (1958). The automatic creation of literature abstracts. IBM Journal of research and development, 2(2), 159-165.
  6. A. R. Pal, D. Saha, “An approach to automatic text summarization using WordNet,” 2014 IEEE International Advance Computing Conference (IACC), India, pp. 1169-1173, 2014.
  7. D. Gunawan, S.H. Harahap, R.F. Rahmat, “Multi-document Summarization by using TextRank and Maximal Marginal Relevance for Text in Bahasa Indonesia,” 2019 International Conference on ICT for Smart Society (ICISS), Indonesia, pp. 1-5, 2019.
  8. Al-Sabahi, Kamal & Zuping, Zhang & Nadher, Mohammed. (2018). A Hierarchical Structured Self-Attentive Model for Extractive Document Summarization (HSSAS). IEEE Access. PP. 1-1. 10.1109/ACCESS.2018.2829199.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Mühendislik

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

10 Ekim 2022

Gönderilme Tarihi

1 Temmuz 2022

Kabul Tarihi

6 Eylül 2022

Yayımlandığı Sayı

Yıl 2022 Cilt: 5 Sayı: 1

Kaynak Göster

APA
Toprak, A., & Turan, M. (2022). Helmholtz-Based Automatic Document Summarization. Veri Bilimi, 5(1), 13-25. https://izlik.org/JA84FG87UG
AMA
1.Toprak A, Turan M. Helmholtz-Based Automatic Document Summarization. Veri Bilim Derg. 2022;5(1):13-25. https://izlik.org/JA84FG87UG
Chicago
Toprak, Ahmet, ve Metin Turan. 2022. “Helmholtz-Based Automatic Document Summarization”. Veri Bilimi 5 (1): 13-25. https://izlik.org/JA84FG87UG.
EndNote
Toprak A, Turan M (01 Ekim 2022) Helmholtz-Based Automatic Document Summarization. Veri Bilimi 5 1 13–25.
IEEE
[1]A. Toprak ve M. Turan, “Helmholtz-Based Automatic Document Summarization”, Veri Bilim Derg, c. 5, sy 1, ss. 13–25, Eki. 2022, [çevrimiçi]. Erişim adresi: https://izlik.org/JA84FG87UG
ISNAD
Toprak, Ahmet - Turan, Metin. “Helmholtz-Based Automatic Document Summarization”. Veri Bilimi 5/1 (01 Ekim 2022): 13-25. https://izlik.org/JA84FG87UG.
JAMA
1.Toprak A, Turan M. Helmholtz-Based Automatic Document Summarization. Veri Bilim Derg. 2022;5:13–25.
MLA
Toprak, Ahmet, ve Metin Turan. “Helmholtz-Based Automatic Document Summarization”. Veri Bilimi, c. 5, sy 1, Ekim 2022, ss. 13-25, https://izlik.org/JA84FG87UG.
Vancouver
1.Ahmet Toprak, Metin Turan. Helmholtz-Based Automatic Document Summarization. Veri Bilim Derg [Internet]. 01 Ekim 2022;5(1):13-25. Erişim adresi: https://izlik.org/JA84FG87UG