کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
439261 690480 2008 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Algorithms for computing variants of the longest common subsequence problem
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Algorithms for computing variants of the longest common subsequence problem
چکیده انگلیسی

The longest common subsequence (LCS) problem is one of the classical and well-studied problems in computer science. The computation of the LCS is a frequent task in DNA sequence analysis, and has applications to genetics and molecular biology. In this paper we introduce new variants of LCS problem and present efficient algorithms to solve them. In particular we introduce the notion of gap constraints in the LCS problems. For the LCS problem with fixed gap, we first present a naive algorithm runs in O(n2+R(K+1)2) time, where R is the total number of ordered pairs of positions at which the two strings match and K is the fixed gap constraint. We then improve the running time to O(n2+RK+Rloglogn) using some novel techniques. Furthermore, we present an algorithm that is independent of K and runs in O(n2+Rloglogn) time. Using these techniques, we also present a new O(n2) algorithm to solve the original LCS problem. Additionally, we modify our algorithms to handle elastic and rigid gaps. We also apply the notion of rigidness to the original LCS problem and modify the traditional dynamic programming solution to handle the rigidness presenting a O(n2) algorithm to solve the problem. Finally, we also improve the solution to Rigid Fixed Gap LCS to O(n2). Notably, in all of the above cases, we assume that the two given strings are of equal length i.e. n. But our results can be easily extended to handle two strings of different length.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Theoretical Computer Science - Volume 395, Issues 2–3, 1 May 2008, Pages 255-267