کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4395227 1618392 2016 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Clustering in non-parametric multivariate analyses
ترجمه فارسی عنوان
خوشه بندی در تجزیه و تحلیل چند متغیره غیر پارامتری
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم آبزیان
چکیده انگلیسی


• Dendrograms may be poor representations of inter-sample dissimilarities.
• ANOSIM R and SIMPROF are combined to generate new methods of clustering.
• UNCTREE is a binary divisive clustering algorithm.
• k-R clustering is a flat clustering algorithm.
• Robustness of clustering is assessed by applying different methods to example data.

Non-parametric multivariate analyses of complex ecological datasets are widely used. Following appropriate pre-treatment of the data inter-sample resemblances are calculated using appropriate measures. Ordination and clustering derived from these resemblances are used to visualise relationships among samples (or variables). Hierarchical agglomerative clustering with group-average (UPGMA) linkage is often the clustering method chosen. Using an example dataset of zooplankton densities from the Bristol Channel and Severn Estuary, UK, a range of existing and new clustering methods are applied and the results compared. Although the examples focus on analysis of samples, the methods may also be applied to species analysis. Dendrograms derived by hierarchical clustering are compared using cophenetic correlations, which are also used to determine optimum β in flexible beta clustering. A plot of cophenetic correlation against original dissimilarities reveals that a tree may be a poor representation of the full multivariate information. UNCTREE is an unconstrained binary divisive clustering algorithm in which values of the ANOSIM R statistic are used to determine (binary) splits in the data, to form a dendrogram. A form of flat clustering, k-R clustering, uses a combination of ANOSIM R and Similarity Profiles (SIMPROF) analyses to determine the optimum value of k, the number of groups into which samples should be clustered, and the sample membership of the groups. Robust outcomes from the application of such a range of differing techniques to the same resemblance matrix, as here, result in greater confidence in the validity of a clustering approach.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Experimental Marine Biology and Ecology - Volume 483, October 2016, Pages 147–155
نویسندگان
, , ,