کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6857937 664775 2014 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Enhancing diversity and coverage of document summaries through subspace clustering and clustering-based optimization
ترجمه فارسی عنوان
افزایش تنوع و پوشش خلاصه های اسناد از طریق خوشه بندی زیربخش و بهینه سازی مبتنی بر خوشه بندی
کلمات کلیدی
خلاصه سازی سند، تنوع اطلاعات، پوشش اطلاعات خوشه بندی فضای مجاز،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
Sentence clustering has been successfully applied in document summarization to discover the topics conveyed in a collection of documents. However, existing clustering-based summarization approaches are seldom targeted for both diversity and coverage of summaries, which are believed to be the two key issues to determine the quality of summaries. The focus of this work is to explore a systematic approach that allows diversity and coverage to be tackled within an integrated clustering-based summarization framework. Given the fact that normally each topic can be described by a set of keywords and the choice of the keywords among the topics is topic-dependent, we take the advantage of the newly emerged subspace clustering to enable the flexibility of keyword selection and the improved quality of sentence clustering. On this basis, we develop two clustering-based optimization strategies, namely local optimization and global optimization to pursue our targets. Experimental results on the DUC datasets demonstrate effectiveness and robustness of the proposed approach.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 279, 20 September 2014, Pages 764-775
نویسندگان
, , ,