| Article ID | Journal | Published Year | Pages | File Type |
|---|---|---|---|---|
| 10355236 | Information Processing & Management | 2005 | 22 Pages |
Abstract
Vector space, probability, and Okapi BM25 ranking are extended to include structure weighting. Weights are then selected for the TREC WSJ collection using a genetic algorithm. The learned weights are then tested on an evaluation set of queries. Structure weighted vector space inner product and structure weighted probabilistic retrieval show an about 5% improvement in mean average precision over their unstructured counterparts. Structure weighted BM25 shows nearly no improvement. Analysis suggests BM25 cannot be improved using structure weighting.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science Applications
Authors
Andrew Trotman,
