SDbQfSum: Query-focused summarization framework basedon diversity and text semantic analysis

Abstract

Query-focused multi-document summarization (Qf-MDS) is a sub-task of automatic text summarization that aims to extract a substitute summary from a document cluster of the same topic and based on a user query. Unlike other summarization tasks, Qf-MDS has specific research challenges including the differences and similarities across related document sets, the high degree of redundancy inherent in the summaries created from multiple related sources, relevance to the given query, topic diversity in the produced summary and the small source-to-summary compression ratio. In this work, we propose a semantic diversity feature based query-focused extractive summarizer (SDbQfSum) built on powerful text semantic representation techniques underpinned with Wikipedia commonsense knowledge in order to address the query-relevance, centrality, redundancy and diversity challenges. Specifically, a semantically parsed document text is combined with knowledge-based vectorial representation to extract effective sentence importance and query-relevance features. The proposed monolingual summarizer is evaluated on a standard English dataset for automatic query-focused summarization tasks, that is, the DUC2006 dataset. The obtained results show that our summarizer outperforms most state-of-the-art related approaches on one or more ROUGE measures achieving 0.418, 0.092 and 0.152 in ROUGE-1, ROUGE-2,and ROUGE-SU4 respectively. It also attains competitive performance with the slightly outperforming system(s), for example, the difference between our system's result and best system in ROUGE-1 is just 0.006. We also found through the conducted experiments that our proposed custom cluster merging algorithm significantly reduces information redundancy while maintaining topic diversity across documents.

Publication DOI: https://doi.org/10.1111/exsy.13462
Divisions: College of Business and Social Sciences > Aston Business School > Operations & Information Management
College of Business and Social Sciences > Aston Business School
Additional Information: © 2023 The Authors. Expert Systems published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
Uncontrolled Keywords: query-focused summarization,query-relevance,semantic role labeling,sentence centrality,sentence similarity,Artificial Intelligence,Theoretical Computer Science,Control and Systems Engineering,Computational Theory and Mathematics
Publication ISSN: 1468-0394
Last Modified: 29 Apr 2024 07:43
Date Deposited: 02 Oct 2023 13:17
Full Text Link:
Related URLs: https://onlinel ... 1111/exsy.13462 (Publisher URL)
http://www.scop ... tnerID=8YFLogxK (Scopus URL)
PURE Output Type: Article
Published Date: 2024-01
Published Online Date: 2023-09-29
Accepted Date: 2023-09-15
Authors: Mohamed, Muhidin
Oussalah, Mourad
Chang, Victor (ORCID Profile 0000-0002-8012-5852)

Download

[img]

Version: Published Version

License: Creative Commons Attribution

| Preview

Export / Share Citation


Statistics

Additional statistics for this record