Article ID Journal Published Year Pages File Type
396710 Information Systems 2014 16 Pages PDF
Abstract

•We propose a new top-n query, which ranks and retrieves the top-n points according to the cardinalities of the corresponding BRkNN answer sets.•To avoid computing the exact BRkNN answer set for each candidate point, we propose a mechanism to quickly compute the upper bounds of the cardinalities.•We propose an algorithm based on the concepts of Voronoi Diagram, bisectors, and triangle inequality for efficiently computing the BRkNN answer set for candidates.•We also solve an extended problem about coverage maximization, i.e., finding a set of n data points to maximize the union of their respective BRkNN answer sets.

A reverse k-nearest neighbor (RkNN) query retrieves the data points which regard the query point as one of their respective k nearest neighbors. A bi-chromatic reverse k-nearest neighbor (BRkNN) query is a variant of the RkNN query, considering two types of data. Given two types of data G and C, a BRkNN query regarding a data point q in G retrieves the data points from C that regard q as one of their respective k-nearest neighbors among the data points in G. Many existing approaches answer either the RkNN query or the BRkNN query. Different from these approaches, in this paper, we make the first attempt to propose a top-n query based on the concept of BRkNN queries, which ranks the data points in G and retrieves the top-n points according to the cardinalities of the corresponding BRkNN answer sets. For efficiently answering this top-n query, we construct the Voronoi Diagram of G to index the data points in G and C. From the information associated with the Voronoi Diagram of G, the upper bound of the cardinality of the BRkNN answer sets for each data point in G can be quickly computed. Moreover, based on an existing approach to answering the RkNN query and the characteristics of the Voronoi Diagram of G, we propose a method to find the candidate region regarding a BRkNN query, which tightens the corresponding search space. Finally, based on the triangle inequality, we propose an efficient refinement algorithm for finding the exact BRkNN answers from the candidate regions. To evaluate our approach on answering the top-n query, it is compared with an approach which applies a state-of-the-art algorithm for answering the BRkNN query to each data point in G. The experiment results reveal that our approach has a much better performance.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , , ,