کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
437068 690071 2006 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Fast string matching by using probabilities: On an optimal mismatch variant of Horspool's algorithm
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Fast string matching by using probabilities: On an optimal mismatch variant of Horspool's algorithm
چکیده انگلیسی

The string matching problem, i.e. the task of finding all occurrences of one string as a substring of another one, is a fundamental problem in computer science. Recently, this problem received a great deal of attention due to numerous applications in computational biology. In this paper we address a modified version of Horspool's string matching algorithm using the probabilities of the different symbols to speed up the search. We show that the modified algorithm has a linear average running time; a precise asymptotical representation of the running time will be proven. A comparison of the average running time of the modified algorithm with well-known results for the original method shows that a substantial speed up for most of the symbol distributions has been achieved. Finally, we show that the distribution of the symbols can be approximated to a high precision using a random sample of sublinear size.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Theoretical Computer Science - Volume 359, Issues 1–3, 14 August 2006, Pages 329-343