کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
383119 660802 2016 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
How to Improve Text Summarization and Classification by Mutual Cooperation on an Integrated Framework
ترجمه فارسی عنوان
چگونگی بهبود تلخیص و طبقه بندی متن با همکاری متقابل در یک چارچوب یکپارچه
کلمات کلیدی
خلاصه متن؛ طبقه بندی متن؛ مدل مستقل باینری؛ مدل زبانی مبتنی بر خوشه ؛ ماشین بردار پشتیبان
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی


• An effective integrated framework using both of summary and category information.
• The summarization technique utilizes the category information from classification.
• The classification technique utilizes the summary information from summarization.
• This integrated framework achieves significant improvement.

Text summarization and classification are core techniques to analyze a huge amount of text data in the big data environment. Moreover, as the need to read texts on smart phones, tablets and television as well as personal computers continues to grow, text summarization and classification techniques become more important and both of them do essential processes for text analysis in many applications.Traditional text summarization and classification techniques have individually been considered as different research fields in this literature. However, we find out that they can help each other as text summarization makes use of category information from text classification and text classification does summary information from text summarization. Therefore, we propose an effective integrated learning framework using both of summary and category information in this paper. In this framework, the feature-weighting method for text summarization utilizes a language model to combine feature distributions in each category and text, and one for text classification does the sentence importance scores estimated from the text summarization.In the experiments, the performances of the integrated framework are better than ones of individual text summarization and classification. In addition, the framework has some advantages of easy implementation and language independence because it is based on only simple statistical approaches and POS tagger.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 60, 30 October 2016, Pages 222–233
نویسندگان
, , ,