Article ID Journal Published Year Pages File Type
379421 Data & Knowledge Engineering 2007 21 Pages PDF
Abstract

Histogram techniques have been used in many commercial database management systems to estimate a query result size. Recently, it has been shown that they are very effective to support approximation of query processing especially aggregates. In this paper, we investigate the problem of minimizing average errors of approximate aggregates using histogram techniques. Firstly, we present a novel linear-spline histogram model that is more accurate than the existing models. Secondly, we propose a novel histogram construction technique for minimizing such average errors, which is shown to generate a near optimal histogram. Our experiment results demonstrate that the new histogram construction techniques lead to a great accuracy improvement on the existing techniques.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , , ,