کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
433858 689640 2015 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Succinct indexes for reporting discriminating and generic words
ترجمه فارسی عنوان
شاخص های مختلط برای گزارش کلمات تبعیض آمیز و عمومی
کلمات کلیدی
شاخص های مختلط، رشته جستجو، محدوده پرس و جو
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی

We consider the problem of indexing a collection DD of D strings (documents) of total n characters from an alphabet set of size σ, such that whenever a pattern P (of p   characters) and an integer τ∈[1,D]τ∈[1,D] come as a query, we can efficiently report all (i) maximal generic words and (ii) minimal discriminating words as defined below:
• maximal generic word is a maximal extension of P occurring in at least τ documents.
• minimal discriminating word is a minimal extension of P occurring in at most τ documents.These problems were introduced by Kucherov et al. (SPIRE) [8], they proposed indexes occupying O(nlog⁡n)O(nlog⁡n) bits with query times O(p+output)O(p+output) and O(p+log⁡log⁡n+output)O(p+log⁡log⁡n+output) for Problem (i) and Problem (ii) respectively. The query time for Problem (ii) is later improved to optimal O(p+output)O(p+output) by Gawrychowski et al. (SPIRE) [6]. In this paper, we describe succinct indexes of nlog⁡σ+o(nlog⁡σ)+O(n)nlog⁡σ+o(nlog⁡σ)+O(n) bits space with near-optimal query times i.e., O(p+log⁡log⁡n+output)O(p+log⁡log⁡n+output) for both these problems.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Theoretical Computer Science - Volume 593, 16 August 2015, Pages 165–173
نویسندگان
, , , ,