کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
397408 671192 2013 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Cost-aware query planning for similarity search
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Cost-aware query planning for similarity search
چکیده انگلیسی

Similarity search aims to find all objects similar to a query object. Typically, some base similarity measures for the different properties of the objects are defined, and light-weight similarity indexes for these measures are built. A query plan specifies which similarity indexes to use with which similarity thresholds and how to combine the results. Previous work creates only a single, static query plan to be used by all queries. In contrast, our approach creates a new plan for each query.We introduce the novel problem of query planning for similarity search, i.e., selecting for each query the plan that maximizes completeness of the results with cost below a query-specific limit. By regarding the frequencies of attribute values we are able to better estimate plan completeness and cost, and thus to better distribute our similarity comparisons. Evaluation on a large real-world dataset shows that our approach significantly reduces cost variance and increases overall result completeness compared to static query plans.


► We introduce the problem of query planning for similarity search with.
► Query-specific cost limits.
► We define exact and approximate query planning algorithms.
► We evaluate on a large, real-world dataset.
► Our approach yields more complete results with a more reliable query runtime.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Systems - Volume 38, Issue 4, June 2013, Pages 455–469
نویسندگان
, ,