کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1147911 957808 2009 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
On the quality of k-means clustering based on grouped data
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه ریاضیات ریاضیات کاربردی
پیش نمایش صفحه اول مقاله
On the quality of k-means clustering based on grouped data
چکیده انگلیسی

Let us have a probability distribution P   (possibly empirical) on the real line RR. Consider the problem of finding the k-mean of P, i.e. a set A of at most k points that minimizes given loss-function. It is known that the k-mean can be found using an iterative algorithm by Lloyd [1982. Least squares quantization in PCM. IEEE Transactions on Information Theory 28, 129–136]. However, depending on the complexity of the distribution P, the application of this algorithm can be quite resource-consuming. One possibility to overcome the problem is to group the original data and calculate the k-mean on the basis of the grouped data. As a result, the new k-mean will be biased, and our aim is to measure the loss of the quality of approximation caused by such approach.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Statistical Planning and Inference - Volume 139, Issue 11, 1 November 2009, Pages 3836–3841
نویسندگان
, ,