کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
384008 660838 2014 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Algorithm to determine ε-distance parameter in density based clustering
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Algorithm to determine ε-distance parameter in density based clustering
چکیده انگلیسی


• Proposed density based clustering approach adaptively determines ε-distance parameter.
• The methodology is based on the notion of k-nearest neighbours concept.
• Clustering quality of dimensions depend on the data distribution along that dimension.
• Dimensions having clustering quality less than threshold are pruned.
• This makes it appropriate for high dimensional data, as well varying density data.

The well known clustering algorithm DBSCAN is founded on the density notion of clustering. However, the use of global density parameter ε-distance makes DBSCAN not suitable in varying density datasets. Also, guessing the value for the same is not straightforward. In this paper, we generalise this algorithm in two ways. First, adaptively determine the key input parameter ε-distance, which makes DBSCAN independent of domain knowledge satisfying the unsupervised notion of clustering. Second, the approach of deriving ε-distance based on checking the data distribution of each dimension makes the approach suitable for subspace clustering, which detects clusters enclosed in various subspaces of high dimensional data. Experimental results illustrate that our approach can efficiently find out the clusters of varying sizes, shapes as well as varying densities.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 41, Issue 6, May 2014, Pages 2939–2946
نویسندگان
, ,