کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
435846 689944 2015 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Space-efficient representation of truncated suffix trees, with applications to Markov order estimation
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Space-efficient representation of truncated suffix trees, with applications to Markov order estimation
چکیده انگلیسی

Suffix trees (ST) are useful in many text processing applications, for example, to determine the number of occurrences of patterns of arbitrary length in an input string x. If the length n, of x  , is large, the memory required to represent the ST may become a practical performance bottleneck. This problem can be alleviated, in cases where a nontrivial upper bound is known on the lengths of the patterns of interest, by using a truncated ST (TST). However, conventional TST implementations still require Ω(n)Ω(n) bits of memory, since they store x. We describe a new TST representation that avoids this limitation by storing all the information necessary to reconstruct the TST edge labels in a string y that is often much shorter than x  . We apply TSTs to the implementation of Markov order estimators, where an upper bound knkn on the estimated order can be derived or it is imposed (for consistency, for example). The new representation allows for estimator implementations with sublinear space complexity in some cases of interest. In other cases we show, experimentally, that even when the new representation does not have an asymptotic advantage, it still achieves very significant memory savings in practice.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Theoretical Computer Science - Volume 595, 30 August 2015, Pages 34–45
نویسندگان
, , ,