Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
10360676 | Pattern Recognition | 2005 | 4 Pages |
Abstract
In this paper, the conventional k-modes-type algorithms for clustering categorical data are extended by representing the clusters of categorical data with k-populations instead of the hard-type centroids used in the conventional algorithms. Use of a population-based centroid representation makes it possible to preserve the uncertainty inherent in data sets as long as possible before actual decisions are made. The k-populations algorithm was found to give markedly better clustering results through various experiments.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Vision and Pattern Recognition
Authors
Dae-Won Kim, KiYoung Lee, Doheon Lee, Kwang H. Lee,