BGPT: Paper Review: No evidence that ACE2 or TMPRSS2 drive population disparity in COVID risks

Fuel Your Discoveries

Quick Explanation Copied

Core claim (skeptical check)

From the authors’ critique of the earlier ACE2/TMPRSS2 “population disparity” argument, the observed population differences are plausibly dominated by uneven genotype sampling (large samples reveal more rare variants), and the shortlisted ACE2/TMPRSS2 variants are largely too rare to typify most individuals—so they provide no strong empirical basis for ACE2/TMPRSS2 driving population-level COVID risk differences.

Evidence cited directly from the paper: BGPT paper-review context.

Primary critique source:

Long Explanation

Paper Review (Visual + Critical): No evidence that ACE2 or TMPRSS2 drive population disparity in COVID risks

Primary source:

1) What the paper is saying (tight mechanistic framing)

Mechanistic premise: ACE2 and TMPRSS2 are biologically relevant SARS-CoV-2 entry components, so population-level genetic differences could in principle matter.
Main objection: the earlier “population disparity” inference is largely an artifact of how public genotype datasets were sampled across populations and how rare variants were summarized.
Practical consequence: because the shortlisted ACE2/TMPRSS2 variants are typically very rare, most people carry none of them—so those “tallies” are unlikely to explain population-level risk differences.

These points are laid out throughout the paper’s argument about lopsided sample sizes, rare-variant discovery effects, and the mismatch between “variant counts” and frequency-weighted risk effects.

2) Visual: lopsided genotype sampling (how rare-variant counts get biased)

The paper highlights that some population groups in the pooled public genotype data were sampled much more heavily than others, which makes “rare variant found-at-all” tallies inflate large-sample groups even under no true underlying frequency difference.

3) Visual: why “variant tallies” can mislead when variants are rare

The paper argues that simply counting shortlisted rare variants found in each population’s sample is not equivalent to quantifying population-level risk, because detection probability and observed counts scale with sample size (discovery bias).

Paper’s implied statistical mismatch

Observed tally: “how many shortlisted variants appear in population sample data.”
Risk quantity needed: “expected number of carriers × expected effect size, integrated over genotype frequencies.”
Why it matters: for rare variants, “appears at least once” is far noisier and sample-size-dependent than “carrier frequency.”

4) Evidence the paper invokes against ACE2/TMPRSS2 as major population drivers

The paper cites a broader COVID host-genetics literature (as referenced within it) to argue that other loci show stronger, more reproducible associations with COVID-19 susceptibility/severity than ACE2/TMPRSS2 variants, and that only limited evidence exists among the earlier shortlisted set.

5) Mechanistic side-check (biology vs. population inference)

Even if ACE2/TMPRSS2 are plausible entry factors, the paper’s central point is about inference correctness: whether population-to-outcome differences can be attributed to ACE2/TMPRSS2 using the early genotype-prediction pipeline. A separate layer of biology comes from experimental work showing expression/function of ACE2/TMPRSS2 and their role in SARS-CoV-2 determinants—however, that does not automatically validate the early population-disparity inference.

6) Skeptical audit: key strengths and key weak spots

Strengths (what improves scientific reliability)

Targets a specific inference error: it directly argues that “rare variant presence” in unevenly sampled datasets does not equal “population-wide carrier frequency differences.”
Focus on frequency relevance: it emphasizes that what matters for risk is expected effects scaled by genotype frequencies, not predictive tallies of variants found at least once.

Weak spots / unknowns (what could still be missing)

Scope limitation: the critique centers on the shortlisted ACE2/TMPRSS2 variants from the earlier predictive pipeline; it may not fully rule out that other ACE2/TMPRSS2 variants (including regulatory variants not captured by the shortlist) could matter, though it argues heuristics and available evidence do not support net population disparity attributable to these genes.
Attribution difficulty: even if ACE2/TMPRSS2 variants modestly affect infection or severity in individuals, translating that into population differences requires correctly modeled carrier distributions and properly accounting for confounders—any remaining phenotype heterogeneity can mask small genetic contributions. The paper itself emphasizes the overwhelming importance of other factors for COVID risk variation.

7) What would disprove (or substantially revise) the paper’s conclusion?

Demonstrate that ACE2 and/or TMPRSS2 frequency-weighted genotype effects (not just rare-variant tallies) produce a statistically significant contribution to population-level COVID incidence/severity differences, after accounting for sampling biases and phenotype definition heterogeneity.
Provide robust evidence that shortlisted (or newly identified) ACE2/TMPRSS2 variants show clear replication in large, well-powered rare-variant association frameworks for relevant COVID phenotypes.

Author reviews (jump to BGPT deep-dive)

Feedback:

Updated: April 30, 2026