Article ID Journal Published Year Pages File Type
437080 Theoretical Computer Science 2012 11 Pages PDF
Abstract

We present the variable length local decoding, a method which augments the alphabet of a sequence or a set of sequences. Roughly speaking, the approach distinguishes several types of symbols/nucleotides according to their contexts in the sequences. These contexts have variable lengths and are defined from a prefix code.We first give an original algorithm computing the decoding with a complexity linear both in time and memory space. Next, the approach is applied to alignment-free sequence comparison. We give a heuristic way to select context lengths relevant to this question. The comparison of sequences itself is based on the composition in “augmented” symbols of their variable length local decodings. The results of this comparison are illustrated on a biological alignment.

Related Topics
Physical Sciences and Engineering Computer Science Computational Theory and Mathematics