Energy-Based Interaction-Aware Graph Modeling for Unsupervised Long-Document Extractive Summarization
Abstract
Long-document extractive summarisation presents persistent challenges, including high redundancy, diffuse topical structure, and limited availability of high-quality supervision. Existing unsupervised graph-based methods improve upon classical centrality algorithms but rely on iterative ranking procedures that are sensitive to graph density and lack formal optimality guarantees. This paper introduces a fully unsupervised extractive summarisation framework in which sentence salience is formulated as a convex energy minimization problem over a semantic-similarity graph. The objective combines a centrality-driven prior with graph Laplacian regularisation, enabling joint inference of sentence importance while preserving convexity and admitting a closed-form solution via a single linear system solve. Discrete sentence selection is performed using a salience threshold, followed by a cosine-similarity-based redundancy filter in a separate greedy stage, preserving the tractability and global optimality of the inference step. Experiments on GovReport, BillSum, and PubMed demonstrate consistent improvements in ROUGE-1 over classical graph-based baselines, indicating improved unigram content coverage. Performance on ROUGE-2 and ROUGE-L is competitive with recent unsupervised approaches on some datasets but shows gaps on others, most notably in ROUGE-L on BillSum, which we attribute to the redundancy filter's effect on local sentence-level coherence. Sensitivity analyses confirm stability across a broad range of graph regularisation strengths and sparsity thresholds. These results support convex energy-based modeling as a principled, reproducible, and domain-independent alternative to heuristic iterative ranking for unsupervised long-document summarisation, while also identifying local coherence preservation as a direction for future improvement.
Keywords
References
- N. Alami, M. E. Mallahi, H. Amakdouf, H. Qjidaa, Hybrid method for text summarization based on statistical and semantic treatment, Multimedia Tools and Applications, 80(13) (2021), 19567–19600. https://doi.org/10.1007/s11042-021-10613-9
- A. S. Bashir, A. A. Bichi, U. Mahmud, A. M. Bello, Long-Text Abstractive Summarization using Transformer Models: A Systematic Review, Journal of the Brazilian Computer Society, 31(1) (2025), 1264–1279.
- I. Beltagy, M. E. Peters, A. Cohan, Longformer: The Long-Document Transformer (No. arXiv:2004.05150), arXiv, 2020. https://doi.org/10.48550/arXiv.2004.05150
- R. Cardenas, M. Galle, S. B. Cohen, On the Trade-off between Redundancy and Local Coherence in Summarization, Journal of Artificial Intelligence Research, 80 (2024), 273–326. https://doi.org/10.1613/jair.1.15191
- A. Cohan, F. Dernoncourt, D. S. Kim, T. Bui, S. Kim, W. Chang, N. Goharian, A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), pp. 615–621, 2018. https://doi.org/10.18653/v1/N18-2097
- Y. Dong, A. Mircea, J. C. K. Cheung, Discourse-Aware Unsupervised Summarization for Long Scientific Documents, Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pp. 1089–1102, 2021. https://aclanthology.org/2021.eacl-main.93
- G. Erkan, D. R. Radev, LexRank: Graph-based Lexical Centrality as Salience in Text Summarization, Journal of Artificial Intelligence Research, 22 (2004), 457–479. https://doi.org/10.1613/jair.1523
- D. Gillick, B. Favre, A Scalable Global Model for Summarization, In J. Clarke & S. Riedel (Eds.), Proceedings of the Workshop on Integer Linear Programming for Natural Language Processing (pp. 10–18), Association for Computational Linguistics, 2009. https://aclanthology.org/W09-1802/
Details
Primary Language
English
Subjects
Natural Language Processing
Journal Section
Research Article
Authors
Abubakar Bashir
*
0000-0002-4346-6826
Nigeria
Early Pub Date
June 19, 2026
Publication Date
June 30, 2026
Submission Date
January 1, 2026
Acceptance Date
April 25, 2026
Published in Issue
Year 2026 Volume: 9 Number: 3
