کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1123316 1488532 2011 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Content-Structure Correspondence: A Generic Representation for Heterogeneous Structured Document
موضوعات مرتبط
علوم انسانی و اجتماعی علوم انسانی و هنر هنر و علوم انسانی (عمومی)
پیش نمایش صفحه اول مقاله
Content-Structure Correspondence: A Generic Representation for Heterogeneous Structured Document
چکیده انگلیسی

This on the web, most structured document collections consist of documents from different sources and marked up with different types of structures. The diversity of structures has lead to the emergence of heterogeneous structured documents. The heterogeneity of structured documents poses new challenges for document representation in structured document retrieval. The representation model needs to handle various types of structures as well as multiple structures in a single document. Furthermore, same information may be represented in different structures and information contained in different documents may be partial and inconsistent. Therefore, the linkage of semantically related elements in the document collections needs to be modelled in the representation model. In this paper, we introduce a generic and flexible structured document model to represent heterogeneous structured documents as well as the similar correspondences in the document collections.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia - Social and Behavioral Sciences - Volume 27, 2011, Pages 226-232