Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
434636 | Theoretical Computer Science | 2013 | 16 Pages |
Abstract
This paper proposes a class of string kernels that can handle a variety of subsequence-based features. Slight adaptations of the basic algorithm allow for weighing subsequence lengths, restricting or soft-penalizing gap-size, character-weighing and soft-matching of characters. An easy extension of the kernels allows for comparing run-length encoded strings with a time-complexity that is independent of the length of the original strings. Such kernels have applications in image processing, computational biology, in demography and in comparing partial rankings.
Related Topics
Physical Sciences and Engineering
Computer Science
Computational Theory and Mathematics