کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
396660 670532 2016 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
The similarity-aware relational database set operators
ترجمه فارسی عنوان
اپراتورهای مجموعه پایگاه داده رابطه ای آگاه از شباهت
کلمات کلیدی
پردازش پرس و جوی شباهت؛ پایگاه داده های رابطه‌ای؛ اپراتورهای تنظیم
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

Identifying similarities in large datasets is an essential operation in several applications such as bioinformatics, pattern recognition, and data integration. To make a relational database management system similarity-aware, the core relational operators have to be extended. While similarity-awareness has been introduced in database engines for relational operators such as joins and group-by, little has been achieved for relational set operators, namely Intersection, Difference, and Union. In this paper, we propose to extend the semantics of relational set operators to take into account the similarity of values. We develop efficient query processing algorithms for evaluating them, and implement these operators inside an open-source database system, namely PostgreSQL. By extending several queries from the TPC-H benchmark to include predicates that involve similarity-based set operators, we perform extensive experiments that demonstrate up to three orders of magnitude speedup in performance over equivalent queries that only employ regular operators.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Systems - Volume 59, July 2016, Pages 79–93
نویسندگان
, , , , ,