Article ID Journal Published Year Pages File Type
428304 Information Processing Letters 2007 5 Pages PDF
Abstract

The longest common subsequence (LCS) problem can be used to measure the relationship between sequences. In general, the inputs of the LCS problem are two sequences. For finding the relationship between one sequence and a set of sequences, we cannot apply the traditional LCS algorithms immediately. In this paper, we define the mosaic LCS (MLCS) problem of finding a mosaic sequence C, composed of repeatable k sequences in source sequence set S, such that the LCS of C and the target sequence T is maximal. Based on the concept of break points in sequence T, we first propose a divide-and-conquer algorithm with O(n2m|S|+n3logk) time for solving this problem, where n and m are the length of T and the maximal length of sequences in S, respectively. Furthermore, an improved algorithm with O(n(m+k)|S|) time is proposed by applying an efficient preprocessing for the MLCS problem.

Related Topics
Physical Sciences and Engineering Computer Science Computational Theory and Mathematics