Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
6861414 | Knowledge-Based Systems | 2018 | 35 Pages |
Abstract
In this paper we propose a novel semi-supervised method, d-graph, which does not assume any predefined structure of clusters. We follow a discriminative approach and use logistic function to directly model posterior probabilities p(k|x) that point x belongs to kth cluster. Making use of these posterior probabilities we maximize the expected probability that pairwise constraints are preserved. To include unlabeled data in our clustering objective function, we introduce additional pairwise constraints so that nearby points are more likely to appear in the same cluster. The proposed model can be easily optimized with the use of gradient techniques and kernelized, which allows to discover arbitrary shapes and structures in data. The experimental results performed on various types of data demonstrate that d-graph obtains better clustering results than comparative state-of-the-art methods.
Related Topics
Physical Sciences and Engineering
Computer Science
Artificial Intelligence
Authors
Marek Åmieja, Oleksandr Myronov, Jacek Tabor,