کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
496304 862856 2012 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Cross-document structural relationship identification using supervised machine learning
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Cross-document structural relationship identification using supervised machine learning
چکیده انگلیسی

Multi document analysis has been a field of interest for decades and is still being actively researched until today. One example of such analysis could be for the task of multi document summarization which is meant to represent the concise description of the original documents. In this paper, we will focus on some special properties that multi document articles hold, specifically news articles. Information across news articles reporting on the same story are often related. Cross-document structure theory (CST) gives several relationships between pairs of sentences from different documents. Among them, we focus on four relations namely “Identity”, “Overlap”, “Subsumption”, and “Description”. Our aim is to automatically identify these CST relationships. We applied three machine learning techniques, i.e. SVM, neural network and our proposed case-based reasoning (CBR) model. Comparison between these techniques shows that the proposed CBR model yields better results.

Figure optionsDownload as PowerPoint slideHighlights
► Supervised machine learning for CST relationship identification.
► Identify four relationship types namely “Identity”, “Overlap”, “Subsumption”, and “Description”.
► Comparing SVM, NN and proposed CBR model.
► Overall CBR obtains better accuracy than SVM and NN.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Applied Soft Computing - Volume 12, Issue 10, October 2012, Pages 3124–3131
نویسندگان
, , ,