Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4944221 | Information Sciences | 2017 | 6 Pages |
Abstract
k-means is a preeminent partitional based clustering method that finds k clusters from the given dataset by computing distances from each point to k cluster centers iteratively. The filtering algorithm improves the performance of k-means by imposing an index structure on the dataset and reduces the number of cluster centers searched while finding the nearest center of a point. The performance of filtering algorithm is influenced by the degree of separation between initial cluster centers. In this paper, we propose an efficient initial seed selection method, RDBI, to improve the performance of k-means filtering method by locating the seed points at dense areas of the dataset and well separated. The dense areas are identified by representing the data points in a kd-tree. A comprehensive experimental analysis is performed to evaluate the performance efficiency of proposed method against state-of-the-art initialization methods and shown that the proposed method is efficient in terms of both running time and clustering accuracy.
Related Topics
Physical Sciences and Engineering
Computer Science
Artificial Intelligence
Authors
K. Mahesh Kumar, A. Rama Mohan Reddy,