K-means algorithms for functional data

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
409709	679086	2015	15 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Reproducing kernel Hilbert space - بازسازی هسته فضای هیلبرت Functional data - داده های عملکردی k-Means - میانگین ـ کی Dimensionality reduction - کاهش ابعاد، فروکاهی ابعاد

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

چکیده انگلیسی

Cluster analysis of functional data considers that the objects on which you want to perform a taxonomy are functions f:X⊂Rp↦Rf:X⊂Rp↦R and the available information about each object is a sample in a finite set of points fn={(xi,yi)∈X×R}i=1n. The aim is to infer the meaningful groups by working explicitly with its infinite-dimensional nature.In this paper the use of K-means algorithms to solve this problem is analysed. A comparative study of three K-means algorithms has been conducted. The K-means algorithm for raw data, a kernel K-means algorithm for raw data and a K -means algorithm using two distances for functional data are tested. These distances, called dVndVn and dϕdϕ, are based on projections onto Reproducing Kernel Hilbert Spaces (RKHS) and Tikhonov regularization theory. Although it is shown that both distances are equivalent, they lead to two different strategies to reduce the dimensionality of the data. In the case of dVndVn distance the most suitable strategy is Johnson–Lindenstrauss random projections. The dimensionality reduction for dϕdϕ is based on spectral methods.A key aspect that has been analysed is the effect of the sampling {xi}i=1n on the K-means algorithm performance. In the numerical study an ex professo example is given to show that if the sampling is not uniform in X, then a K-means algorithm that ignores the functional nature of the data can reduce its performance. It is numerically shown that the original K-means algorithm and that suggested here lead to similar performance in the examples when X is uniformly sampled, but the computational cost when working with the original set of observations is higher than the K -means algorithms based on dϕdϕ or dVndVn, as they use strategies to reduce the dimensionality of the data.The numerical tests are completed with a case study to analyse what kind of problem the K-means algorithm for functional data must face.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 151, Part 1, 3 March 2015, Pages 231–245

نویسندگان

María Luz López García, Ricardo García-Ródenas, Antonia González Gómez,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

K-means algorithms for functional data

دسترسی سریع

ارتباط

English Website