Node-Weighted Centrality Ranking for Unsupervised Long Document Summarization

Tuba Gokhan*, Phillip Smith, Mark Lee

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Supervised methods have demonstrated superior performance to unsupervised methods in text summarization. However, supervised methods heavily rely on human-generated summaries, which can be costly and difficult to obtain in large quantities. They also face challenges in summarizing long documents due to input length restrictions. Graph-based methods are frequently employed in unsupervised text summarization owing to their capacity to examine interrelationships between. However, these methods usually depend on unique node weights, resulting in limited mapping capabilities and weak performance on long documents. To address these difficulties, this study proposes an unsupervised method that employs a graph model with augmented node weights with a novel centrality ranking algorithm. Comprehensive experiments on standard datasets demonstrate the effectiveness of the proposed method, which outperforms both unsupervised and supervised techniques when evaluated using the ROUGE metric.
Original languageEnglish
Title of host publicationNatural Language Processing and Information Systems
Subtitle of host publication28th International Conference on Applications of Natural Language to Information Systems, NLDB 2023, Derby, UK, June 21–23, 2023, Proceedings
EditorsElisabeth Métais, Farid Meziane, Vijayan Sugumaran, Warren Manning, Stephan Reiff-Marganiec
Place of PublicationCham
PublisherSpringer
Pages299–312
Number of pages14
Edition1
ISBN (Electronic)9783031353208
ISBN (Print)9783031353192
DOIs
Publication statusPublished - 14 Jun 2023
Event28th International Conference on Applications of Natural Language to Information Systems - Derby, United Kingdom
Duration: 21 Jun 202323 Jun 2023

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume13913
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference28th International Conference on Applications of Natural Language to Information Systems
Abbreviated titleNLDB 2023
Country/TerritoryUnited Kingdom
CityDerby
Period21/06/2323/06/23

Bibliographical note

Acknowledgments:
The first author would like to acknowledge the Ministry of National Education of Turkey for the financial support of her research activity.

Keywords

  • SentenceBERT
  • Ranking
  • Sentence Centrality
  • Unsupervised
  • Latent Semantic Analysis
  • Sentence Feature Scoring

Fingerprint

Dive into the research topics of 'Node-Weighted Centrality Ranking for Unsupervised Long Document Summarization'. Together they form a unique fingerprint.

Cite this