Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4944867 | Information Sciences | 2017 | 21 Pages |
Abstract
We prove the LCSK query is NP-hard, and devise exact algorithm as well as approximate algorithm with provable approximation bound to this problem. The proposed exact algorithm, namely MergeList, explores the candidate space progressively with several pruning strategies, which is based on the keyword hash table index structure. Unfortunately, this approach is not scalable to large datasets. We thus develop an approximate algorithm called MaxMargin. It finds the answer by traversing the proposed LIR-tree in the best-first fashion. Moreover, two optimizing strategies are used to improve the query performance. The experiments on real and synthetic datasets verify that the proposed approximate algorithm runs much faster than the competitor with desired accuracy.
Related Topics
Physical Sciences and Engineering
Computer Science
Artificial Intelligence
Authors
Pengfei Zhang, Huaizhong Lin, Bin Yao, Dongming Lu,