کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
405247 | 677516 | 2012 | 11 صفحه PDF | دانلود رایگان |

In this paper an adaptive hierarchical fuzzy clustering algorithm is presented, named Hierarchical Data Divisive Soft Clustering (H2D-SC). The main novelty of the proposed algorithm is that it is a quality driven algorithm, since it dynamically evaluates a multi-dimensional quality measure of the clusters to drive the generation of the soft hierarchy. Specifically, it generates a hierarchy in which each node is split into a variable number of sub-nodes, determined by an innovative quality assessment of soft clusters, based on the evaluation of multiple dimensions such as the cluster’s cohesion, its cardinality, its mass, and its fuzziness, as well as the partition’s entropy. Clusters at the same hierarchical level share a minimum quality value: clusters in the lower levels of the hierarchy have a higher quality; this way more specific clusters (lower level clusters) have a higher quality than more general clusters (upper level clusters). Further, since the algorithm generates a soft partition, a document can belong to several sub-clusters with distinct membership degrees. The proposed algorithm is divisive, and it is based on a combination of a modified bisecting K-Means algorithm with a flat soft clustering algorithm used to partition each node. The paper describes the algorithm and its evaluation on two standard collections.
► An adaptive Hierarchical Data Divisive Soft Clustering algorithm is defined.
► It is a generalization of a modified bisecting K-Means with a Fuzzy C-Means.
► It dynamically generates clusters with a controlled multi-dimensional quality.
► Quality measures are cohesion, cardinality, fuzziness, specificity and entropy.
► An adaptive and variable number of hierarchy levels and clusters is generated.
Journal: Knowledge-Based Systems - Volume 26, February 2012, Pages 9–19