Joint Hierarchical Semantic Clipping and Sentence Extraction for Document Summarization


Wanying Yan, Junjun Guo, Journal of Information Processing Systems Vol. 16, No. 4, pp. 820-831, Aug. 2020  

10.3745/JIPS.04.0181
Keywords: Extractive Summarization, Hierarchical Selective Encoding, Redundant Information Clipping
Fulltext:

Abstract

Extractive document summarization aims to select a few sentences while preserving its main information on a given document, but the current extractive methods do not consider the sentence-information repeat problem especially for news document summarization. In view of the importance and redundancy of news text information, in this paper, we propose a neural extractive summarization approach with joint sentence semantic clipping and selection, which can effectively solve the problem of news text summary sentence repetition. Specifically, a hierarchical selective encoding network is constructed for both sentence-level and documentlevel document representations, and data containing important information is extracted on news text; a sentence extractor strategy is then adopted for joint scoring and redundant information clipping. This way, our model strikes a balance between important information extraction and redundant information filtering. Experimental results on both CNN/Daily Mail dataset and Court Public Opinion News dataset we built are presented to show the effectiveness of our proposed approach in terms of ROUGE metrics, especially for redundant information filtering.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.




Cite this article
[APA Style]
Wanying Yan and Junjun Guo (2020). Joint Hierarchical Semantic Clipping and Sentence Extraction for Document Summarization. Journal of Information Processing Systems, 16(4), 820-831. DOI: 10.3745/JIPS.04.0181.

[IEEE Style]
W. Yan and J. Guo, "Joint Hierarchical Semantic Clipping and Sentence Extraction for Document Summarization," Journal of Information Processing Systems, vol. 16, no. 4, pp. 820-831, 2020. DOI: 10.3745/JIPS.04.0181.

[ACM Style]
Wanying Yan and Junjun Guo. 2020. Joint Hierarchical Semantic Clipping and Sentence Extraction for Document Summarization. Journal of Information Processing Systems, 16, 4, (2020), 820-831. DOI: 10.3745/JIPS.04.0181.