کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
396719 670558 2014 28 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Integrating domain heterogeneous data sources using decomposition aggregation queries
ترجمه فارسی عنوان
ادغام منابع داده ناهمگن دامنه با استفاده از نمادهای تجمعی تجزیه
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی


• We introduce the decomposition aggregation query (DAQ) to handle domain heterogeneity.
• We develop query rewriting algorithms for DAQs.
• We process DAQs in two phases and optimize the query processing.

The decomposition aggregation query (DAQ) we introduce in this paper extends semantic integration queries by allowing query translation to create aggregate queries based on the DAQ's novel three role structure. We describe the application of DAQs in integrating domain heterogeneous data sources, the new semantics of DAQ answers and the query translation algorithm called “aggregation rewriting”.A central problem of optimizing DAQ processing requires determining the data sources towards which the DAQ is translated. Our source selection algorithm has cover-finding and partitioning steps which are optimized to 1. lower the processing overhead while speeding up query answering and 2. eliminate duplicates with minimal overhead. We establish connections between source selection optimizations and classic NP-hard optimizations and resolve the optimization problems with efficient solvers. We empirically study both the DAQ query translation and the source selection algorithms using real-world and synthetic data sets; the results show satisfying scalability both in size of aggregations and data sources for the query translation algorithms and the source selection algorithms save a good amount of computational resources.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Systems - Volume 39, January 2014, Pages 80–107
نویسندگان
, ,