Article ID Journal Published Year Pages File Type
406464 Neurocomputing 2014 14 Pages PDF
Abstract

•A novel algorithm is proposed for manifold clustering and representative selection.•A new method is described to find manifold centroid in Riemannian geometry.•A new random walk model measures the similarity of data points on nonlinear manifold.•An effective method is introduced to remove the bridging points between manifolds.

Many real-world applications expose the nonlinear manifold structure of the lower dimension rather than its high-dimensional input space. This greatly challenges most existing clustering and representative selection algorithms which do not take the manifold characteristics into consideration. The performance of the corresponding learning algorithms can be greatly improved if the manifold structure is considered. In this paper, we propose a graph-based k-means algorithm, GKM, which bears the simplicity of classic k-means while incorporating global information of data geometric distribution. GKM fully exploits the intrinsic manifold structure for appropriate data clustering and representative selection. GKM is evaluated on both synthetic and real-life data sets and achieves very impressive results compared to the state-of-the-art approaches, including classic k-means, kernel k-means, spectral clustering, and clustering through ranking and for representative selection. Given the widespread appearance of manifold structures in real world problems, GKM shows promising potential for partitioning manifold-distributed data.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , , ,