Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
10326476 | Neurocomputing | 2016 | 15 Pages |
Abstract
Clustering ensemble is an important part of ensemble learning. It aims to study and integrate multiple clustering results from different clustering algorithms or same algorithm with different initial parameters for the same dataset. CHAMELEON is a hierarchical clustering algorithm which can discover natural clusters of different shapes and sizes as the result of its merging decision dynamically adapts to the different clustering model characterized. Inspired by the idea of CHAMELEON, the paper proposes a novel clustering ensemble models including semi-supervised method and discusses its application in fault diagnosis of high speed train (HST) running gear. The contributions of this paper include: constructing a sparse graph via the similarity matrix which aggregates multiple clustering results; partitioning the sparse graph (vertex=object, edge weight=similarity) into a large number of relatively small sub-clusters; obtaining the final clustering partition by merging these sub-clusters repeatedly. The experimental results demonstrate that our method outperforms some of state-of-the-art ensemble algorithms regarding the accuracy and stability and recognizes fault patterns of HST running gear effectively.
Related Topics
Physical Sciences and Engineering
Computer Science
Artificial Intelligence
Authors
Wenchao Xiao, Yan Yang, Hongjun Wang, Tianrui Li, Huanlai Xing,