Semi-supervised discriminative clustering with graph regularization

Article ID	Journal	Published Year	Pages	File Type
6861414	Knowledge-Based Systems	2018	35 Pages	PDF

Abstract

In this paper we propose a novel semi-supervised method, d-graph, which does not assume any predefined structure of clusters. We follow a discriminative approach and use logistic function to directly model posterior probabilities p(k|x) that point x belongs to kth cluster. Making use of these posterior probabilities we maximize the expected probability that pairwise constraints are preserved. To include unlabeled data in our clustering objective function, we introduce additional pairwise constraints so that nearby points are more likely to appear in the same cluster. The proposed model can be easily optimized with the use of gradient techniques and kernelized, which allows to discover arbitrary shapes and structures in data. The experimental results performed on various types of data demonstrate that d-graph obtains better clustering results than comparative state-of-the-art methods.

Keywords

Semi-supervised clustering Graph clustering Pairwise constraints Discriminative model