Robust Cut for Hierarchical Clustering and Merge Trees

dc.contributor.authorBanesh, Divyaen_US
dc.contributor.authorAhrens, Jamesen_US
dc.contributor.authorBujack, Roxanaen_US
dc.contributor.editorTominski, Christianen_US
dc.contributor.editorWaldner, Manuelaen_US
dc.contributor.editorWang, Beien_US
dc.date.accessioned2024-05-17T18:48:10Z
dc.date.available2024-05-17T18:48:10Z
dc.date.issued2024
dc.description.abstractHierarchical clustering arrange multi-dimensional data into a tree-like structure, organizing the data by increasing levels of similarity. A cut of the tree divides data into clusters, where cluster members share a likeness. Most common cutting techniques identify a single line, either by a metric or with user input, cutting horizontally through the tree, separating root from leaves. We present a new approach that algorithmically identifies cuts at multiple levels of the tree based on a metric we call robustness. We identify levels to maximize overall robustness by maximizing the height of the shortest branch of the hierarchical tree we must cut through. This technique minimizes the variation within clusters while maximizing the distance between clusters. We apply the same approach to merge trees from computational topology to find the most robust number of connected components. We apply the multi-level robust cut to two datasets to highlight the advantages compared to a traditional, single-level cut.en_US
dc.description.sectionheadersMerge Trees, Uncertainty, and Studies
dc.description.seriesinformationEuroVis 2024 - Short Papers
dc.identifier.doi10.2312/evs.20241070
dc.identifier.isbn978-3-03868-251-6
dc.identifier.pages5 pages
dc.identifier.urihttps://doi.org/10.2312/evs.20241070
dc.identifier.urihttps://diglib.eg.org/handle/10.2312/evs20241070
dc.publisherThe Eurographics Associationen_US
dc.rightsAttribution 4.0 International License
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectCCS Concepts: Mathematics of computing → Algebraic topology; Information systems → Clustering and classification
dc.subjectMathematics of computing → Algebraic topology
dc.subjectInformation systems → Clustering and classification
dc.titleRobust Cut for Hierarchical Clustering and Merge Treesen_US
Files
Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
14_evs20241070.pdf
Size:
791.56 KB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
1036-i10.mp4
Size:
8.96 MB
Format:
Video MP4
Collections