Article ID Journal Published Year Pages File Type
515048 Information Processing & Management 2011 15 Pages PDF
Abstract

In a hierarchical XML structure, surrounding elements form the context of an XML element. In document-oriented XML, the context is a part of the semantics of the element and augments its textual information. The process of taking the context of the element into account in element scoring is called contextualization. This study extends the concept of contextualization and presents a classification of contextualization models. In an XML collection, elements are of different granularity, i.e. lower level elements are shorter and carry less textual information. Thus, it seems credible that contextualization interacts differently with diverse elements. Even if it is known that contextualization leads to improved effectiveness in element retrieval, the improvement on different granularity levels has not been investigated. This study explores the effect of contextualization on these levels. Further, a parameterized framework for testing contextualization is presented.The empirical part of the study is carried out in a traditional laboratory setting, where an XML collection is granulated. This is necessary in order to measure performance separately at different hierarchy levels. The results confirm the effectiveness of contextualization, and show how the elements of different granularities benefit from contextualization.

► The concept of contextualization is extended. ► A classification of contextualization models is defined. ► A more general contextualization function is developed. ► The vertical and horizontal contextualization are tested in a tailored test setting. ► The effect of cont. is discovered beneficial with different element granularities.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science Applications
Authors
, , ,