کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
426819 686300 2011 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Faster query algorithms for the text fingerprinting problem
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Faster query algorithms for the text fingerprinting problem
چکیده انگلیسی

Let S be a string over a finite, ordered alphabet Σ. For any substring S′ of S, the set of distinct characters contained in S′ is called its fingerprint. The text fingerprinting indexing problem is to construct a data structure for the string S in advance, so that on given any input subset C of Σ, we can answer the following queries efficiently: (1) determine if C represents a fingerprint of some substrings in S; (2) find all maximal substrings of S whose fingerprint is C. The best known results solved these two queries in Θ(|Σ|) and Θ(|Σ|+K) time, respectively, where K is the number of maximal substrings. In this paper, we propose two improved algorithms for the text fingerprinting indexing problem. The first one solves the two queries in and time, respectively. For the second one, the query time complexities are further reduced to O(|C|log(|Σ|/|C|)) and O(|C|log(|Σ|/|C|)+K). Both results answer an open problem proposed by Amir et al.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information and Computation - Volume 209, Issue 7, July 2011, Pages 1057-1069