کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
387048 | 660895 | 2013 | 9 صفحه PDF | دانلود رایگان |

• A novel general-purpose multi-document summarizer is proposed.
• The proposed summarizer relies on an established ontology-based model.
• The summarization process tightly integrates a semantics-based text evaluation.
• The summarizer effectiveness has been validated on benchmark documents.
Sentence-based multi-document summarization is the task of generating a succinct summary of a document collection, which consists of the most salient document sentences. In recent years, the increasing availability of semantics-based models (e.g., ontologies and taxonomies) has prompted researchers to investigate their usefulness for improving summarizer performance. However, semantics-based document analysis is often applied as a preprocessing step, rather than integrating the discovered knowledge into the summarization process.This paper proposes a novel summarizer, namely Yago-based Summarizer, that relies on an ontology-based evaluation and selection of the document sentences. To capture the actual meaning and context of the document sentences and generate sound document summaries, an established entity recognition and disambiguation step based on the Yago ontology is integrated into the summarization process.The experimental results, which were achieved on the DUC’04 benchmark collections, demonstrate the effectiveness of the proposed approach compared to a large number of competitors as well as the qualitative soundness of the generated summaries.
Journal: Expert Systems with Applications - Volume 40, Issue 17, 1 December 2013, Pages 6976–6984