Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
515703 | Information Processing & Management | 2010 | 18 Pages |
Abstract
The paper presents methods of retrieving blog posts containing opinions about an entity expressed in the query. The methods use a lexicon of subjective words and phrases compiled from manually and automatically developed resources. One of the methods uses the Kullback–Leibler divergence to weight subjective words occurring near query terms in documents, another uses proximity between the occurrences of query terms and subjective words in documents, and the third combines both factors. Methods of structuring queries into facets, facet expansion using Wikipedia, and a facet-based retrieval are also investigated in this work. The methods were evaluated using the TREC 2007 and 2008 Blog track topics, and proved to be highly effective.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science Applications
Authors
Olga Vechtomova,