Article ID Journal Published Year Pages File Type
6938802 Pattern Recognition 2018 17 Pages PDF
Abstract
Clustering ensembles (CE) comprise a class of pattern recognition methods that take a set of data clusterings (base partitions) as input and generate a consensus, better-quality partition as output. This work tackles the CE problem from a hedonic game theoretical perspective. In the modeled cooperative game, data points are viewed as players while clusters are regarded as coalitions. Interestingly, we show that by using an evidence-accumulation based similarity measure our novel Hedonic Game based Clustering Ensemble (HGCE) algorithm always converges to a Nash stable coalition structure, that is, to a clustering solution that cannot be unilaterally improved from the standpoint of each data point. A variant of the algorithm is also introduced, which is insensitive to the way the data points are ordered in the data set. In order to assess the potentials of HGCE and contrast its performance with that exhibited by a number of CE methods, experiments have been conducted on several artificial and real-world data sets, the majority of which related to bioinformatics. Overall, the empirical results and statistical tests relative to two well-known external validity measures ratify the usefulness and competitiveness of the proposed approach, also showing that HGCE is computationally efficient and resilient to random perturbations to the set of base partitions.
Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, ,