Text indexing with errors

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
430938	688234	2007	20 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Suffix array - آرایه سوفی Suffix tree - درخت سلفی Trie - سهمیه Levenshtein distance - فاصله Levenshtein Text indexing - نمایه سازی متن edit distance - ویرایش فاصله

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات

پیش نمایش صفحه اول مقاله

چکیده انگلیسی

In this paper we address the problem of constructing an index for a text document or a collection of documents to answer various questions about the occurrences of a pattern when allowing a constant number of errors. In particular, our index can be built to report all occurrences, all positions, or all documents where a pattern occurs in time linear in the size of the query string and the number of results. This improves over previous work where the look-up time was either not linear or depended upon the size of the document corpus. Our data structure has size O(nlogdn)O(nlogdn) on average and with high probability for input size n and queries with up to d errors. Additionally, we present a trade-off between query time and index complexity that achieves worst-case bounded index size and preprocessing time with linear look-up time on average.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Discrete Algorithms - Volume 5, Issue 4, December 2007, Pages 662–681

نویسندگان

Moritz G. Maaß, Johannes Nowak,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Text indexing with errors

دسترسی سریع

ارتباط

English Website