کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
400819 1438984 2014 22 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
HAUSS: Incrementally building a summarizer combining multiple techniques
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
HAUSS: Incrementally building a summarizer combining multiple techniques
چکیده انگلیسی


• We combine different base techniques for automatic summarization of legal text.
• The techniques are combined in a context-specific way with rules.
• Our knowledge acquisition framework supports the creation of such rules.
• We improve coverage of the knowledge base by re-combining existing rule conditions.
• Our system outperforms machine learning and automatic summarization tools.

The idea of automatic summarization dates back to 1958, when Luhn invented the “auto abstract” (Luhn, 1958). Since then, many diverse automatic summarization approaches have been proposed, but no single technique has solved the increasingly urgent need for automatic summarization. Rather than proposing one more such technique, we suggest that the best solution is likely a system able to combine multiple summarization techniques, as required by the type of documents being summarized. Thus, this paper presents HAUSS: a framework to quickly build specialized summarizers, integrating several base techniques into a single approach. To recognize relevant text fragments, rules are created that combine frequency, centrality, citation and linguistic information in a context-dependent way. An incremental knowledge acquisition framework strongly supports the creation of these rules, using a training corpus to guide rule acquisition, and produce a powerful knowledge base specific to the domain. Using HAUSS, we created a knowledge base for catchphrase extraction in legal text. The system outperforms existing state-of-the-art general-purpose summarizers and machine learning approaches. Legal experts rated the extracted summaries similar to the original catchphrases given by the court. Our investigation of knowledge acquisition methods for summarization therefore demonstrates that it is possible to quickly create effective special-purpose summarizers, which combine multiple techniques, into a single context-aware approach.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: International Journal of Human-Computer Studies - Volume 72, Issue 7, July 2014, Pages 584–605
نویسندگان
, , ,