کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
437425 690139 2016 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Tight lower bound instances for k-means++ in two dimensions
ترجمه فارسی عنوان
موارد کران پایین تنگ برای k-means++ در دو بعد
کلمات کلیدی
k-means++ ؛ کران پایین
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی

The k-means++ seeding algorithm is one of the most popular algorithms that is used for finding the initial k centers when using the Lloyd's algorithm for the k-means problem. It was conjectured by Brunsch and Röglin [9] that k-means++ behaves well for datasets with small dimension. More specifically, they conjectured that the k  -means++ seeding algorithm gives O(log⁡d)O(log⁡d) approximation with high probability for any d-dimensional dataset. In this work, we refute this conjecture by giving two dimensional datasets on which the k  -means++ seeding algorithm achieves an O(log⁡k)O(log⁡k) approximation ratio with probability exponentially small in k. This solves open problems posed by Mahajan et al. [12] and by Brunsch and Röglin [9].

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Theoretical Computer Science - Volume 634, 27 June 2016, Pages 55–66
نویسندگان
, , ,