کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
515744 867088 2008 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Single-document and multi-document summarization techniques for email threads using sentence compression
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Single-document and multi-document summarization techniques for email threads using sentence compression
چکیده انگلیسی

We present two approaches to email thread summarization: collective message summarization (CMS) applies a multi-document summarization approach, while individual message summarization (IMS) treats the problem as a sequence of single-document summarization tasks. Both approaches are implemented in our general framework driven by sentence compression. Instead of a purely extractive approach, we employ linguistic and statistical methods to generate multiple compressions, and then select from those candidates to produce a final summary. We demonstrate these ideas on the Enron email collection – a very challenging corpus because of the highly technical language. Experimental results point to two findings: that CMS represents a better approach to email thread summarization, and that current sentence compression techniques do not improve summarization performance in this genre.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 44, Issue 4, July 2008, Pages 1600–1610
نویسندگان
, , ,