/
IFCS'061A divisive approach for clustering symbolic dataM. ChaventMath IFCS'061A divisive approach for clustering symbolic dataM. ChaventMath

IFCS'061A divisive approach for clustering symbolic dataM. ChaventMath - PDF document

debby-jeon
debby-jeon . @debby-jeon
Follow
376 views
Uploaded On 2016-03-16

IFCS'061A divisive approach for clustering symbolic dataM. ChaventMath - PPT Presentation

IFCS0621 Divisive clustering methoddescendant hierarchical algorithmclassical or symbolic data2 Application for clustering a set of categoriesexample of a set of species contaminated with mercuryco ID: 258194

IFCS'0621. Divisive clustering methoddescendant hierarchical

Share:

Link:

Embed:

Download Presentation from below link

Download Pdf The PPT/PDF document "IFCS'061A divisive approach for clusteri..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript

IFCS'061A divisive approach for clustering symbolic dataM. ChaventMathématiquesAppliquéesde Bordeaux (MAB), UniversitésBordeaux1 et 2 IFCS'0621. Divisive clustering methoddescendant hierarchical algorithmclassical or symbolic data2. Application for clustering a set of categoriesexample of a set of species contaminated with mercurycomparison of numerical and symbolic approach for clustering the species IFCS'064Introduction to di How to split a clusterComplete enumeration (Edward and Cavalli-Sforza, 1965)Heuristics (Mac-Naughton-Smith, 1964, ChidanandaGowdaet al. 1978)Criterion optimisation like split or diameter (Guénoche1991, Wang et al. 1996)Monothetic approachClustering : Williams & Lambert1959Classification :Breimanet al. 1984 (CART) or Quinlan 1986 (ID3) IFCS'066The proposed dissimilarity measure For a numerical symbolic(interval)data matrix :is the Hausdorffdistancebetween two intervalsIf all intervals are reduced to single points, itcorresponds to the Euclidean distanceThe normalized version of this distance : ),(1),('12'2jijipjHjiixxdvxxd== ),(1'12jijininiiHjxxdnv IFCS'068The extension of the within-clusterinertia thedistance matrixthe extension of the inertia of a cluster is :the extension of the within-clusters inertia of a partition in clusters is : 2''21)(iiCkkdnnCI)()(1==KkkCIPW IFCS'0610Binary question and symbolic data IFCS'0612The proposed divisive clustering method Choose at each step the cluster to be splitsuch that the new partition has minimum within inertia ?Stops after L iterations given as input by the user and the output is a hierarchyindexed by , which is isotonic )()()()(21kkkkCQCQCQCŠŠ= IFCS'0614First numerical treatment Clusteringof the 67 fish:Ward hierarchical clfish5 concentration variables Potamotrygonhystrix Platydorascostatus Semaprochilodusvarii85.7114.29 Pseudoancistrusbarbatus Leporinusfrederici66.6733.33 Leporinusfasciatus Dorasmicropoeus Hopliasaimara Cynodongibbus Ageneiosusbrevifili Cluster 4 Cluster3 IFCS'0616Second numerical treatment Clustering of the 10 species on a classical numerical data set…DorasmicropoeusAgeneiosusbrevifiliLn(Stomach/Muscle)Ln(liver/Muscle) IFCS'0619 …with the DIV method :Symbolic treatment