کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
535849 870392 2012 5 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Density-based hierarchical clustering for streaming data
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Density-based hierarchical clustering for streaming data
چکیده انگلیسی

For streaming data that arrive continuously such as multimedia data and financial transactions, clustering algorithms are typically allowed to scan the data set only once. Existing research in this domain mainly focuses on improving the accuracy of clustering. In this paper, a novel density-based hierarchical clustering scheme for streaming data is proposed in order to improve both accuracy and effectiveness; it is based on the agglomerative clustering framework. Traditionally, clustering algorithms for streaming data often use the cluster center to represent the whole cluster when conducting cluster merging, which may lead to unsatisfactory results. We argue that even if the data set is accessed only once, some parameters, such as the variance within cluster, the intra-cluster density and the inter-cluster distance, can be calculated accurately. This may bring measurable benefits to the process of cluster merging. Furthermore, we employ a general framework that can incorporate different criteria and, given the same criteria, will produce similar clustering results for both streaming and non-streaming data. In experimental studies, the proposed method demonstrates promising results with reduced time and space complexity.


► A density-based hierarchical clustering scheme is proposed for streaming data.
► The scheme is based on the agglomerative clustering framework.
► Even if the data set is accessed only once, some parameters can be calculated accurately.
► A new merging criterion incorporating the above factors is proposed also.
► The proposed method demonstrates promising results with reduced time and space complexity.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 33, Issue 5, 1 April 2012, Pages 641–645
نویسندگان
, , , , ,