Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
489374 | Procedia Computer Science | 2015 | 10 Pages |
Abstract
An algorithm of finding documents on a given topic based on a selected reference collection of documents along with creating context-semantic graph for visualizing themes in search results is presented. The algorithm is based on integration of set of probabilistic, entropic, and semantic markers for extractions of weighted key words and combinations of words, which describe the given topic. Test results demonstrate an average precision of 99% and the recall of 84% on expert selection of documents. Also developed special approach to constructing graph on base of algorithms that extract key phrases with weights. It gives the possibility to demonstrate a structure of subtopics in large collections of documents in compact graph form.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science (General)