Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
10355230 | Information Processing & Management | 2005 | 15 Pages |
Abstract
Most previous information retrieval (IR) models assume that terms of queries and documents are statistically independent from each other. However, conditional independence assumption is obviously and openly understood to be wrong, so we present a new method of incorporating term dependence into a probabilistic retrieval model by adapting a dependency structured indexing system using a dependency parse tree and Chow Expansion to compensate the weakness of the assumption. In this paper, we describe a theoretic process to apply the Chow Expansion to the general probabilistic models and the state-of-the-art 2-Poisson model. Through experiments on document collections in English and Korean, we demonstrate that the incorporation of term dependences using Chow Expansion contributes to the improvement of performance in probabilistic IR systems.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science Applications
Authors
Changki Lee, Gary Geunbae Lee,