Shi et al. (2022) systematically compare common beta-diversity metrics and clustering algorithms across five gut-16S datasets, identify data properties that drive failure modes of BrayβCurtis and unweighted UniFrac, and propose a simple combined metric (normalized BC + normalized UU, Ξ± = 0.5) that is robust across their examples and available as an R package (MicrobiomeCluster) for reproducibility and follow-up analyses.
Visualizations below reproduce key numeric summaries from the paper and demonstrate the mechanistic points the authors make about BrayβCurtis and unweighted UniFrac failure modes. Every factual claim below is tied to the original paper (Shi et al. 2022) via inline citation.
Data values are taken directly from the paper's Table 1 / extracted data summary and illustrate the very low summed highβabundance signal in the Schnorr dataset (0.058), which the authors link to BrayβCurtis underperformance in that dataset.
This plotted pattern reproduces the core qualitative results: UU does very well in several geographically separated datasets (including perfect for Schnorr) but fails on Smits (seasonal, many shared lowβabundance OTUs); BrayβCurtis fails when highβabundance signal is scarce (Schnorr) but is good when highβabundance OTUs drive separation (MartΓnez). The combined metric shows consistent midβtoβhigh performance across datasets in the authors' results.
All of these caveats are discussed by the authors; see the Availability/Methods and Supplement for their R package that enables reβanalysis.
Primary source (all claims above are anchored to this paper):
Custom summaries of the latest cutting edge Science research. Every Friday. No Ads.