Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
515221 | Information Processing & Management | 2006 | 9 Pages |
Abstract
Contextual document clustering is a novel approach which uses information theoretic measures to cluster semantically related documents bound together by an implicit set of concepts or themes of narrow specificity. It facilitates cluster-based retrieval by assessing the similarity between a query and the cluster themes’ probability distribution. In this paper, we assess a relevance feedback mechanism, based on query refinement, that modifies the query’s probability distribution using a small number of documents that have been judged relevant to the query. We demonstrate that by providing only one relevance judgment, a performance improvement of 33% was obtained.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science Applications
Authors
Niall Rooney, David Patterson, Mykola Galushka, Vladimir Dobrynin,