MOCS-BERT: Multi-Objective Cuckoo Search with Sentence-BERT Embeddings for Semantically Enhanced Extractive Summarization
Abstract
This study presents MOCS-BERT, a novel extractive text summarization framework that effectively integrates multi-objective Cuckoo Search Optimization with Sentence-BERT embeddings to generate semantically coherent and readable summaries. Evaluated on the full CNN/DailyMail test set comprising 11,490 documents, the proposed model demonstrates statistically significant superiority over three established metaheuristic algorithms namely Bat Algorithm (BA), Flower Pollination Algorithm (FPA), and Firefly Algorithm (FA) as confirmed by the Wilcoxon signed-rank test (p = 0.0432 and p = 0.0010, respectively). Although it does not show statistical significance against Flower Pollination Algorithm (p = 0.0990) among the three baseline metaheuristic algorithms evaluated (BA, FPA, FA), MOCS-BERT consistently achieves the highest ROUGE-L F1 score (0.1895), underscoring its exceptional ability to preserve narrative coherence and logical structure in generated summaries. Furthermore, the model produces highly readable output, with a Flesch Reading Ease of 66.9 and low grade-level indices, making it accessible to a broad audience. These results validate that the integration of deep semantic representations with a carefully designed multi-objective fitness function balancing semantic relevance, non-redundancy, and readability yields a robust, scalable summarization system with balanced performance across semantic coherence, redundancy control, and readability metrics. The research not only progresses the methodological boundaries of metaheuristic-based text summarization but also provides practical utility in real-world applications, including news aggregation, legal document analysis, and emergency response assistance.. Future work will focus on human evaluation, domain-specific adaptation, and automated hyperparameter tuning to further enhance performance and generalizability.
Keywords
Supporting Institution
Ethical Statement
Thanks
References
- [1] Luo, M., Xue, B. and Niu, B., “A comprehensive survey for automatic text summarization: techniques, approaches and perspectives”, Neurocomputing, 603: 128280, (2024). DOI: https://doi.org/10.1016/j.neucom.2024.128280
- [2] Peyrard, M., “A simple theoretical model of importance for summarization”, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 1059–1073, (2019). DOI: 10.18653/v1/P19-1101
- [3] Devlin, J., Chang, M.-W., Lee, K. and Toutanova, K., “BERT: Pre-training of deep bidirectional transformers for language understanding”, Proceedings of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), 4171–4186, (2019). DOI: 10.18653/v1/N19-1423
- [4] Sharma, G. and Sharma, D., “Automatic text summarization methods: a comprehensive review”, SN Computer Science, 4(1): 33, (2022). DOI: https://doi.org/10.1007/s42979-022-01446-w
- [5] Liu, Y. and Lapata, M., “Text summarization with pretrained encoders”, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 3721–3731, (2019). DOI: https://doi.org/10.48550/arXiv.1908.08345
- [6] Liu, Y., “Fine-tune BERT for extractive summarization”, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 3808–3818, (2019). DOI: https://doi.org/10.48550/arXiv.1903.10318
- [7] See, A., Liu, P. J. and Manning, C. D., “Get to the point: summarization with pointer-generator networks”, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 1073–1083, (2017). DOI: https://doi.org/10.48550/arXiv.1704.04368
- [8] Zhou, Q., Yang, N., Wei, F., Huang, S., Zhou, M. and Zhao, T., “Neural document summarization by jointly learning to score and select sentences”, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 654–663, (2018). DOI: 10.18653/v1/P18-1061
Details
Primary Language
English
Subjects
Computer System Software
Journal Section
Research Article
Authors
Riski Annisa
*
0000-0002-1060-3580
Indonesia
Agung Sasongko
0000-0002-0875-4144
Indonesia
Muhammad Sony Maulana
0000-0001-7254-1234
Indonesia
Early Pub Date
May 3, 2026
Publication Date
June 1, 2026
Submission Date
November 21, 2025
Acceptance Date
March 15, 2026
Published in Issue
Year 2026 Volume: 39 Number: 2