کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
435987 689959 2015 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Compressed automata for dictionary matching
ترجمه فارسی عنوان
اتوماتای ​​فشرده برای تطابق فرهنگ لغت
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی

We address a variant of the dictionary matching problem where the dictionary is represented by a straight line program (SLP). For a given SLP-compressed dictionary DD of size n and height h representing m patterns of total length N  , we present an O(n2log⁡N)O(n2log⁡N)-size representation of Aho–Corasick automaton which recognizes all occurrences of the patterns in DD in amortized O(h+m)O(h+m) running time per character. We also propose an algorithm to construct this compressed Aho–Corasick automaton in O(n3log⁡nlog⁡N)O(n3log⁡nlog⁡N) time and O(n2log⁡N)O(n2log⁡N) space. In a spacial case where DD represents only a single pattern, we present an O(nlog⁡N)O(nlog⁡N)-size representation of the Morris–Pratt automaton which permits us to find all occurrences of the pattern in amortized O(h)O(h) running time per character, and we show how to construct this representation in O(n3log⁡nlog⁡N)O(n3log⁡nlog⁡N) time with O(n2log⁡N)O(n2log⁡N) working space.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Theoretical Computer Science - Volume 578, 3 May 2015, Pages 30–41
نویسندگان
, , , , ,