کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
484823 | 703295 | 2015 | 9 صفحه PDF | دانلود رایگان |

Data clustering has been widely applied to many practical applications, including social network analysis, location-based analysis, and scientific analysis. Considering the ongoing huge increases in the amount of data generated by Internet users and limited memory and computation resources, the design of scalable clustering has become very significant. In this paper, we propose a distributed clustering algorithm named HiClus, which is based on heterogeneous cloud computing. HiClus can efficiently build an adaptive distributed tree in the cloud to utilize computation resources of both central processing unit and general-purpose computing on graphics processing units. The evaluation herein shows that HiClus can scale up better, use less clustering time, and achieve better load balancing than existing MapReduce algorithms.
Journal: Procedia Computer Science - Volume 53, 2015, Pages 149-157