کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
396560 670386 2012 19 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
pest: Fast approximate keyword search in semantic data using eigenvector-based term propagation
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
pest: Fast approximate keyword search in semantic data using eigenvector-based term propagation
چکیده انگلیسی

We present pest, a novel approach to the approximate querying of graph-structured data such as RDF that exploits the data's structure to propagate term weights between related data items. We focus on data where meaningful answers are given through the application semantics, e.g., pages in wikis, persons in social networks, or papers in a research network such as Mendeley. The pest matrix generalizes the Google Matrix used in PageRank with a term-weight dependent leap and accommodates different levels of (semantic) closeness for different relations in the data, e.g., friend vs. co-worker in a social network. Its eigenvectors represent the distribution of a term after propagation. The eigenvectors for all terms together form a (vector space) index that takes the structure of the data into account and can be used with standard document retrieval techniques. In extensive experiments including a user study on a real life wiki, we show how pest improves the quality of the ranking over a range of existing ranking approaches, yet achieves a query performance comparable to a plain vector space index.


► PEST is a novel approach to approximate querying of graph-structured data.
► It propagates term weights over the structure of related data items.
► It generalizes of PageRank with a term-weight dependent leap.
► It improves recall significantly with rankings preferred by users.
► It remains nearly as scalable as basic vector space search engines.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Systems - Volume 37, Issue 4, June 2012, Pages 372–390
نویسندگان
, , , , ,