کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
379417 | 659300 | 2007 | 17 صفحه PDF | دانلود رایگان |
In this paper, a novel robust deterministic annealing (RDA) algorithm is developed for data clustering. This method takes advantage of conventional noise clustering (NC) and deterministic annealing (DA) algorithms in terms of the independence of data initialization, the ability to avoid poor local optima, the better performance for unbalanced data, and the robustness against noise and outliers. In addition, a cluster validity criterion, i.e., Vapnik–Chervonenkis (VC)-bound induced index, which is estimated based on the structural risk minimization (SRM) principle, is specifically extended for RDA to determine the optimal cluster number for a given data set. The superiority of the proposed RDA clustering algorithm is supported by experimental results.
Journal: Data & Knowledge Engineering - Volume 62, Issue 1, July 2007, Pages 84–100