کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
428146 686607 2007 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
On constructing an optimal consensus clustering from multiple clusterings
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
On constructing an optimal consensus clustering from multiple clusterings
چکیده انگلیسی

Computing a suitable measure of consensus among several clusterings on the same data is an important problem that arises in several areas such as computational biology and data mining. In this paper, we formalize a set-theoretic model for computing such a similarity measure. Roughly speaking, in this model we have k>1 partitions (clusters) of the same data set each containing the same number of sets and the goal is to align the sets in each partition to minimize a similarity measure. For k=2, a polynomial-time solution was proposed by Gusfield (Information Processing Letters 82 (2002) 159–164). In this paper, we show that the problem is MAX-SNP-hard for k=3 even if each partition in each cluster contains no more than 2 elements and provide a -approximation algorithm for the problem for any k.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing Letters - Volume 104, Issue 4, 15 November 2007, Pages 137-145