Article ID Journal Published Year Pages File Type
427196 Information Processing Letters 2013 4 Pages PDF
Abstract

•We define the Consensus problem.•We review results for the Euclidean metric and the Hamming string metric.•We show that the Consensus String problem is NP-hard for the swap metric.•We show that the Consensus String problem is APX-hard for the reversal metric.

Finding the consensus of a given set of strings is a hard and challenging problem. The problem is formally defined as follows: given a set of strings S={s1,…,sk} and a constant d, find, if it exists, a string s⁎ such that the distance of s⁎ from each of the strings does not exceed d.This problem has many applications. Two examples are: In biology, it may be used to seek a common ancestor to given sections of DNA. In web searching it may be used as a clustering aid.The stringology community researched this problem under the Hamming distance. In that metric the problem is NP-hard. A lot of work has been also done in the Euclidean metric.In this paper we consider the Consensus problem under other string metrics. We show that this problem is NP-hard for the swap metric and APX-hard for the reversal metric.

Related Topics
Physical Sciences and Engineering Computer Science Computational Theory and Mathematics