Research Article

MOCS-BERT: Multi-Objective Cuckoo Search with Sentence-BERT Embeddings for Semantically Enhanced Extractive Summarization

Volume: 39 Number: 2 June 1, 2026
EN

MOCS-BERT: Multi-Objective Cuckoo Search with Sentence-BERT Embeddings for Semantically Enhanced Extractive Summarization

Abstract

This study presents MOCS-BERT, a novel extractive text summarization framework that effectively integrates multi-objective Cuckoo Search Optimization with Sentence-BERT embeddings to generate semantically coherent and readable summaries. Evaluated on the full CNN/DailyMail test set comprising 11,490 documents, the proposed model demonstrates statistically significant superiority over three established metaheuristic algorithms namely Bat Algorithm (BA), Flower Pollination Algorithm (FPA), and Firefly Algorithm (FA) as confirmed by the Wilcoxon signed-rank test (p = 0.0432 and p = 0.0010, respectively). Although it does not show statistical significance against Flower Pollination Algorithm (p = 0.0990) among the three baseline metaheuristic algorithms evaluated (BA, FPA, FA), MOCS-BERT consistently achieves the highest ROUGE-L F1 score (0.1895), underscoring its exceptional ability to preserve narrative coherence and logical structure in generated summaries. Furthermore, the model produces highly readable output, with a Flesch Reading Ease of 66.9 and low grade-level indices, making it accessible to a broad audience. These results validate that the integration of deep semantic representations with a carefully designed multi-objective fitness function balancing semantic relevance, non-redundancy, and readability yields a robust, scalable summarization system with balanced performance across semantic coherence, redundancy control, and readability metrics. The research not only progresses the methodological boundaries of metaheuristic-based text summarization but also provides practical utility in real-world applications, including news aggregation, legal document analysis, and emergency response assistance.. Future work will focus on human evaluation, domain-specific adaptation, and automated hyperparameter tuning to further enhance performance and generalizability. 

Keywords

Supporting Institution

This research was supported by the Department of Computer Science and Engineering, Bina Sarana Informatika University, Indonesia. The computational experiments were conducted using Google Colab, a cloud-based platform for machine learning research.

Ethical Statement

This study does not involve human or animal participants, and therefore no ethical approval was required. All data used in this research are publicly available through the Hugging Face Datasets library (https://huggingface.co/datasets/cnn_dailymail ), which provides open-access news articles and their reference summaries under non-commercial licenses. The authors confirm that this work adheres to the highest standards of academic integrity and ethical conduct.

Thanks

The authors would like to thank the developers of the Hugging Face Transformers and Sentence-Transformers libraries for providing state-of-the-art NLP tools. We also acknowledge the support of the open-source community behind niapy, which enabled the implementation of nature-inspired optimization algorithms. Special thanks to the anonymous reviewers for their valuable feedback that helped improve the quality of this paper.

References

  1. [1] Luo, M., Xue, B. and Niu, B., “A comprehensive survey for automatic text summarization: techniques, approaches and perspectives”, Neurocomputing, 603: 128280, (2024). DOI: https://doi.org/10.1016/j.neucom.2024.128280
  2. [2] Peyrard, M., “A simple theoretical model of importance for summarization”, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 1059–1073, (2019). DOI: 10.18653/v1/P19-1101
  3. [3] Devlin, J., Chang, M.-W., Lee, K. and Toutanova, K., “BERT: Pre-training of deep bidirectional transformers for language understanding”, Proceedings of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), 4171–4186, (2019). DOI: 10.18653/v1/N19-1423
  4. [4] Sharma, G. and Sharma, D., “Automatic text summarization methods: a comprehensive review”, SN Computer Science, 4(1): 33, (2022). DOI: https://doi.org/10.1007/s42979-022-01446-w
  5. [5] Liu, Y. and Lapata, M., “Text summarization with pretrained encoders”, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 3721–3731, (2019). DOI: https://doi.org/10.48550/arXiv.1908.08345
  6. [6] Liu, Y., “Fine-tune BERT for extractive summarization”, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 3808–3818, (2019). DOI: https://doi.org/10.48550/arXiv.1903.10318
  7. [7] See, A., Liu, P. J. and Manning, C. D., “Get to the point: summarization with pointer-generator networks”, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 1073–1083, (2017). DOI: https://doi.org/10.48550/arXiv.1704.04368
  8. [8] Zhou, Q., Yang, N., Wei, F., Huang, S., Zhou, M. and Zhao, T., “Neural document summarization by jointly learning to score and select sentences”, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 654–663, (2018). DOI: 10.18653/v1/P18-1061

Details

Primary Language

English

Subjects

Computer System Software

Journal Section

Research Article

Early Pub Date

May 3, 2026

Publication Date

June 1, 2026

Submission Date

November 21, 2025

Acceptance Date

March 15, 2026

Published in Issue

Year 2026 Volume: 39 Number: 2

APA
Annisa, R., Sasongko, A., & Maulana, M. S. (2026). MOCS-BERT: Multi-Objective Cuckoo Search with Sentence-BERT Embeddings for Semantically Enhanced Extractive Summarization. Gazi University Journal of Science, 39(2), 957-972. https://doi.org/10.35378/gujs.1827571
AMA
1.Annisa R, Sasongko A, Maulana MS. MOCS-BERT: Multi-Objective Cuckoo Search with Sentence-BERT Embeddings for Semantically Enhanced Extractive Summarization. Gazi University Journal of Science. 2026;39(2):957-972. doi:10.35378/gujs.1827571
Chicago
Annisa, Riski, Agung Sasongko, and Muhammad Sony Maulana. 2026. “MOCS-BERT: Multi-Objective Cuckoo Search With Sentence-BERT Embeddings for Semantically Enhanced Extractive Summarization”. Gazi University Journal of Science 39 (2): 957-72. https://doi.org/10.35378/gujs.1827571.
EndNote
Annisa R, Sasongko A, Maulana MS (June 1, 2026) MOCS-BERT: Multi-Objective Cuckoo Search with Sentence-BERT Embeddings for Semantically Enhanced Extractive Summarization. Gazi University Journal of Science 39 2 957–972.
IEEE
[1]R. Annisa, A. Sasongko, and M. S. Maulana, “MOCS-BERT: Multi-Objective Cuckoo Search with Sentence-BERT Embeddings for Semantically Enhanced Extractive Summarization”, Gazi University Journal of Science, vol. 39, no. 2, pp. 957–972, June 2026, doi: 10.35378/gujs.1827571.
ISNAD
Annisa, Riski - Sasongko, Agung - Maulana, Muhammad Sony. “MOCS-BERT: Multi-Objective Cuckoo Search With Sentence-BERT Embeddings for Semantically Enhanced Extractive Summarization”. Gazi University Journal of Science 39/2 (June 1, 2026): 957-972. https://doi.org/10.35378/gujs.1827571.
JAMA
1.Annisa R, Sasongko A, Maulana MS. MOCS-BERT: Multi-Objective Cuckoo Search with Sentence-BERT Embeddings for Semantically Enhanced Extractive Summarization. Gazi University Journal of Science. 2026;39:957–972.
MLA
Annisa, Riski, et al. “MOCS-BERT: Multi-Objective Cuckoo Search With Sentence-BERT Embeddings for Semantically Enhanced Extractive Summarization”. Gazi University Journal of Science, vol. 39, no. 2, June 2026, pp. 957-72, doi:10.35378/gujs.1827571.
Vancouver
1.Riski Annisa, Agung Sasongko, Muhammad Sony Maulana. MOCS-BERT: Multi-Objective Cuckoo Search with Sentence-BERT Embeddings for Semantically Enhanced Extractive Summarization. Gazi University Journal of Science. 2026 Jun. 1;39(2):957-72. doi:10.35378/gujs.1827571