Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
438298 | Theoretical Computer Science | 2014 | 17 Pages |
Abstract
Recent advances in biotechnology and web technology are continuously generating huge collections of similar strings. People now face the problem of storing them compactly while supporting fast pattern searching. One compression scheme called relative Lempel–Ziv compression uses textual substitutions from a reference text to represent each string in S as a concatenation of substrings from a reference string R. This basic scheme gives a good compression ratio when every string in S is similar to R, but does not provide any pattern searching functionality. Here, we describe a new data structure based on relative Lempel–Ziv compression that is space-efficient and also supports fast pattern searching.
Related Topics
Physical Sciences and Engineering
Computer Science
Computational Theory and Mathematics
Authors
Huy Hoang Do, Jesper Jansson, Kunihiko Sadakane, Wing-Kin Sung,