کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
515856 867120 2014 19 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Text summarization using Wikipedia
ترجمه فارسی عنوان
خلاصه متن با استفاده از ویکی پدیا
کلمات کلیدی
خلاصه سازی، ویکیپدیا، رتبه بندی جمله، شخصی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• Summarization using Wikipedia and graph-based ranking.
• Theoretical analysis for various sentence–concept models.
• Real-time incremental summarization.
• Performance analysis using the ROUGE metric and user evaluations.
• Personalized and query-focused summarization, multi-document summarization.

Automatic text summarization has been an active field of research for many years. Several approaches have been proposed, ranging from simple position and word-frequency methods, to learning and graph based algorithms. The advent of human-generated knowledge bases like Wikipedia offer a further possibility in text summarization – they can be used to understand the input text in terms of salient concepts from the knowledge base. In this paper, we study a novel approach that leverages Wikipedia in conjunction with graph-based ranking. Our approach is to first construct a bipartite sentence–concept graph, and then rank the input sentences using iterative updates on this graph. We consider several models for the bipartite graph, and derive convergence properties under each model. Then, we take up personalized and query-focused summarization, where the sentence ranks additionally depend on user interests and queries, respectively. Finally, we present a Wikipedia-based multi-document summarization algorithm. An important feature of the proposed algorithms is that they enable real-time incremental summarization – users can first view an initial summary, and then request additional content if interested. We evaluate the performance of our proposed summarizer using the ROUGE metric, and the results show that leveraging Wikipedia can significantly improve summary quality. We also present results from a user study, which suggests that using incremental summarization can help in better understanding news articles.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 50, Issue 3, May 2014, Pages 443–461
نویسندگان
, , ,