Article ID Journal Published Year Pages File Type
431141 Journal of Discrete Algorithms 2008 6 Pages PDF
Abstract

We study the problem of clustering fingerprints with at most p missing values   (CMV(p)CMV(p) for short) naturally arising in oligonucleotide fingerprinting  , which is an efficient method for characterizing DNA clone libraries. We show that already CMV(2)CMV(2) is NP-hard. We also show that a greedy algorithm yields a min(1+lnn,2+plnl)min(1+lnn,2+plnl) approximation for CMV(p)CMV(p), and can be implemented to run in O(nlp2)O(nl2p) time. We also introduce other variants of the problem of clustering incomplete fingerprints based on slightly different optimization criteria and show that they can be approximated in polynomial time with ratios 22p−122p−1 and 2(1−122p), respectively.

Related Topics
Physical Sciences and Engineering Computer Science Computational Theory and Mathematics
Authors
, , , , , ,