کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
431320 688504 2012 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
On the number of elements to reorder when updating a suffix array
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
On the number of elements to reorder when updating a suffix array
چکیده انگلیسی

Recently new algorithms appeared for updating the Burrows–Wheeler Transform or the suffix array, when the text they index is modified. These algorithms proceed by reordering entries and the number of such reordered entries may be as high as the length of the text. However, in practice, these algorithms are faster for updating the Burrows–Wheeler Transform or the suffix array than the fastest reconstruction algorithms.In this article we focus on the number of elements to be reordered for real-life texts. We show that this number is related to LCP   values and that, on average, LaveLave entries are reordered, where LaveLave denotes the average LCP value, defined as the average length of the longest common prefix between two consecutive sorted suffixes. Since we know little about the LCP distribution for real-life texts, we conduct experiments on a corpus that consists of DNA sequences and natural language texts. The results show that apart from texts containing large repetitions, the average LCP value is close to the one expected on a random text.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Discrete Algorithms - Volume 11, February 2012, Pages 87–99
نویسندگان
, , ,