Article ID Journal Published Year Pages File Type
412067 Neurocomputing 2015 10 Pages PDF
Abstract

In the first part of this paper, we present a method to build affinity matrices for spectral clustering from a density estimator relying on K-means with subbagging procedure. The approach is anchored in the theoretical works of Wong (1980, 1982a, b) [13], [14] and [15] on the asymptotic properties of K-means as a density estimation method. The subbagging procedure is introduced to improve the density estimate accuracy. The behavior of the proposed method is analyzed on diverse data sets and two new mechanisms are suggested to improve clustering results on non-convex data. In the second part of the paper, we establish a link between the presented method and the evidence accumulation clustering (EAC) approach by showing that a normalized version of the density-based similarity matrix is approximately equal to a normalized version of the co-association matrix. The co-association matrix provides the co-occurrence probability of data pairs assigned to a same cluster over multiple K-means clustering partitions. Experimental results on artificial and real data demonstrate the effectiveness of the method and provide empirical support for the established link.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
,