Banesh, DivyaAhrens, JamesBujack, RoxanaTominski, ChristianWaldner, ManuelaWang, Bei2024-05-172024-05-172024978-3-03868-251-6https://doi.org/10.2312/evs.20241070https://diglib.eg.org/handle/10.2312/evs20241070Hierarchical clustering arrange multi-dimensional data into a tree-like structure, organizing the data by increasing levels of similarity. A cut of the tree divides data into clusters, where cluster members share a likeness. Most common cutting techniques identify a single line, either by a metric or with user input, cutting horizontally through the tree, separating root from leaves. We present a new approach that algorithmically identifies cuts at multiple levels of the tree based on a metric we call robustness. We identify levels to maximize overall robustness by maximizing the height of the shortest branch of the hierarchical tree we must cut through. This technique minimizes the variation within clusters while maximizing the distance between clusters. We apply the same approach to merge trees from computational topology to find the most robust number of connected components. We apply the multi-level robust cut to two datasets to highlight the advantages compared to a traditional, single-level cut.Attribution 4.0 International LicenseCCS Concepts: Mathematics of computing → Algebraic topology; Information systems → Clustering and classificationMathematics of computing → Algebraic topologyInformation systems → Clustering and classificationRobust Cut for Hierarchical Clustering and Merge Trees10.2312/evs.202410705 pages