کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
865553 909674 2009 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Keyword Searches in Data-Centric XML Documents Using Tree Partitioning
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی مهندسی (عمومی)
پیش نمایش صفحه اول مقاله
Keyword Searches in Data-Centric XML Documents Using Tree Partitioning
چکیده انگلیسی
This paper presents an effective keyword search method for data-centric extensive markup language (XML) documents. The method divides an XML document into compact connected integral subtrees, called self-integral trees (SI-Trees), to capture the structural information in the XML document. The SI-Trees are generated based on a schema guide. Meaningful self-integral trees (MSI-Trees) are identified, which contain all or some of the input keywords for the keyword search in the XML documents. Indexing is used to accelerate the retrieval of MSI-Trees related to the input keywords. The MSI-Trees are ranked to identify the top-k results with the highest ranks. Extensive tests demonstrate that this method costs 10-100 ms to answer a keyword query, and outperforms existing approaches by 1-2 orders of magnitude.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Tsinghua Science & Technology - Volume 14, Issue 1, February 2009, Pages 7-18
نویسندگان
, , ,