کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4959373 1445947 2017 43 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
New diagonal bundle method for clustering problems in large data sets
ترجمه فارسی عنوان
روش بسته بندی مورب جدید برای خوشه بندی مشکلات در مجموعه داده های بزرگ
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی
Clustering is one of the most important tasks in data mining. Recent developments in computer hardware allow us to store in random access memory (RAM) and repeatedly read data sets with hundreds of thousands and even millions of data points. This makes it possible to use conventional clustering algorithms in such data sets. However, these algorithms may need prohibitively large computational time and fail to produce accurate solutions. Therefore, it is important to develop clustering algorithms which are accurate and can provide real time clustering in large data sets. This paper introduces one of them. Using nonsmooth optimization formulation of the clustering problem the objective function is represented as a difference of two convex (DC) functions. Then a new diagonal bundle algorithm that explicitly uses this structure is designed and combined with an incremental approach to solve this problem. The method is evaluated using real world data sets with both large number of attributes and large number of data points. The proposed method is compared with two other clustering algorithms using numerical results.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: European Journal of Operational Research - Volume 263, Issue 2, 1 December 2017, Pages 367-379
نویسندگان
, , ,