Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
431141 | Journal of Discrete Algorithms | 2008 | 6 Pages |
Abstract
We study the problem of clustering fingerprints with at most p missing values (CMV(p)CMV(p) for short) naturally arising in oligonucleotide fingerprinting , which is an efficient method for characterizing DNA clone libraries. We show that already CMV(2)CMV(2) is NP-hard. We also show that a greedy algorithm yields a min(1+lnn,2+plnl)min(1+lnn,2+plnl) approximation for CMV(p)CMV(p), and can be implemented to run in O(nlp2)O(nl2p) time. We also introduce other variants of the problem of clustering incomplete fingerprints based on slightly different optimization criteria and show that they can be approximated in polynomial time with ratios 22p−122p−1 and 2(1−122p), respectively.
Related Topics
Physical Sciences and Engineering
Computer Science
Computational Theory and Mathematics
Authors
A. Figueroa, A. Goldstein, T. Jiang, M. Kurowski, A. Lingas, M. Persson,