Abstract
In the extractive multiple documents summarization (MDS) field, most existing works input all related documents simultaneously to generate the summary. As a result, they usually suffer from the domination of one document (because of length or other factors) which jeopardizes the completeness of the summary. We propose a novel neural network model, Self-Attention with Non-Maximum Suppression (SA-NMS), inspired from the perspectives of human reading habits and object detection to solve the problem. Our model trains a self-attentive neural network at the single document level to get a sentence prominence score in its document, treating each document with the same weight to avoid the domination of one document. Then the NMS, an algorithm widely used in object recognition but first adopted in the field of NLP, is implemented as a strong filter to reduce redundancy. The experimental results show that our SA-NMS model not only is easy to train but also performs competitively on the small DUC-2004 dataset (50 pairs in total) and outperforms the extractive approaches on ROUGE scores on the large-scale Multi-News dataset.
Original language | English |
---|---|
Title of host publication | Proceedings of the International Conference on Internet of Things, Communication and Intelligent Technology |
Editors | Jian Dong, Long Zhang |
Publisher | Springer |
Pages | 553-563 |
Number of pages | 11 |
Edition | 1st |
ISBN (Electronic) | 9789819904167 |
ISBN (Print) | 9789819904150, 9789819904181 |
DOIs | |
Publication status | Published - 23 Apr 2023 |
Event | International Conference on Internet of Things, Communication and Intelligent Technology, IoTCIT 2022 - Changsha, China Duration: 22 Aug 2022 → 24 Aug 2022 https://www.iotcit.org/home-2022/ https://link.springer.com/book/10.1007/978-981-99-0416-7 |
Publication series
Name | Lecture Notes in Electrical Engineering |
---|---|
Volume | 1015 LNEE |
ISSN (Print) | 1876-1100 |
ISSN (Electronic) | 1876-1119 |
Name | IoTCIT: International Conference on Internet of Things, Communication and Intelligent Technology |
---|
Conference
Conference | International Conference on Internet of Things, Communication and Intelligent Technology, IoTCIT 2022 |
---|---|
Country/Territory | China |
City | Changsha |
Period | 22/08/22 → 24/08/22 |
Internet address |
Scopus Subject Areas
- Industrial and Manufacturing Engineering
User-Defined Keywords
- Attention
- Document domination
- Multiple document summarization
- Non-maximum suppression