کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
536500 870544 2011 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Improving DBSCAN’s execution time by using a pruning technique on bit vectors
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Improving DBSCAN’s execution time by using a pruning technique on bit vectors
چکیده انگلیسی

Clustering is the process of assigning a set of physical or abstract objects into previously unknown groups. The goal of clustering is to group similar objects into the same clusters and dissimilar objects into different clusters. Similarities between objects are evaluated by using the attribute values of objects. There are many clustering algorithms in the literature; among them, DBSCAN is a well known density-based clustering algorithm. We improve DBSCAN’s execution time performance for binary data sets and Hamming distances. We achieve considerable speed gains by using a novel pruning technique, as well as bit vectors, and binary operations. Our novel method effectively discards distant neighbors of an object and computes only the distances between an object and its possible neighbors. By discarding distant neighbors, we avoid unnecessary distance computations and use less CPU time when compared with the conventional DBSCAN algorithm. However, the accuracy of our method is identical to that of the original DBSCAN. Experimental test results on real and synthetic data sets demonstrate that, by using our pruning technique, we obtain considerably faster execution time results compared to DBSCAN.


► We propose a novel pruning technique that improves DBSCAN’s execution time performance.
► Pruning technique works only on binary data sets and for Hamming distance.
► The new technique speeds up the clustering process up to 40 times.
► It works best on sparse and dense data sets.
► Our technique works faster than R-tree and KD-tree.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 32, Issue 13, 1 October 2011, Pages 1572–1580
نویسندگان
, ,