کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
435391 689902 2009 14 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Masking patterns in sequences: A new class of motif discovery with don’t cares
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Masking patterns in sequences: A new class of motif discovery with don’t cares
چکیده انگلیسی

We introduce a new notion of motifs, called masks, that succinctly represents the repeated patterns for an input sequence T of n symbols drawn from an alphabet Σ. We show how to build the set of all frequent maximal masks of length L in O(2Ln) time and space in the worst case, using the Karp–Miller–Rosenberg approach. We analytically show that our algorithm performs better than the method based on constant-time enumerating and checking all the potential (|Σ|+1)L candidate patterns in T, after a polynomial-time preprocessing of T. Our algorithm is also cache-friendly, attaining block transfers, where sort(n) is the cache complexity of sorting n items.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Theoretical Computer Science - Volume 410, Issue 43, 6 October 2009, Pages 4327-4340