BGPT: Paper Review: DeepRES: Deep learning enables reaction-based comprehensive enzyme screening

Explore by Goal

Quick Explanation Copied

DeepRES is a two-stage, multimodal DL framework that first predicts whether a protein is an enzyme (vs non-enzyme) and then retrieves which enzymatic reactions it can catalyze—designed to work directly for proteins of unknown function and to avoid rigid dependence on EC classification schemas.

Major claimed successes include mapping 897 orphan enzymes (of 1,255 with complete reaction encoding) to candidate proteins in the ESM Metagenomic Atlas and then using those candidates to mine biosynthetic gene clusters (BGCs), including anthocyanin-degradation–related modules (EPM0520, EPM0522).

Long Explanation

Paper Review (Visual, Critical): DeepRES

Framework for reaction-agnostic comprehensive enzyme screening from protein sequences & predicted structures to Reaction SMILES.

Preprint 2025-07 (DeepRES DOI)

Citation:

1) What DeepRES does (visual first)

Pipeline flow (as described in the paper)

Input: proteins (sequence + structure-aware 3Di) and reactions (Reaction SMILES).
Step A (EnzymeCNN): predicts enzyme vs non-enzyme.
Step B (EnzymeCLIP): embeds proteins and reactions into a shared space and retrieves protein–reaction pairs via cosine similarity.
Screening output: for proteins predicted as enzymes, top-retrieved catalytic reactions (and thus candidate orphan enzyme mappings for reactions lacking gene annotations).

2) Dataset scale & mapping funnel (metagenome application)

Key numerical claims used in the funnel: 13,902,058 proteins of unknown function → 595,340 predicted enzymes → 97,074 proteins associated with orphan enzymes → 897 orphan enzymes mapped from 1,255 orphan enzymes with complete reaction SMILES.

3) Example BGC mining outputs: anthocyanin-degradation modules

Claim context: the paper reports that BGC screening for EPM0520 and EPM0522 identified novel BGC candidates from 50 and 98 microbial genomes, respectively.

4) How well does DeepRES address the stated gap? (skeptical but fair)

What problem it targets (known/claimed)

The paper frames the “orphan enzyme” problem as a sequence-to-reaction annotation gap: reactions exist in pathway/reaction resources but lack direct gene-sequence associations, and protein databases include many unknown-function proteins that might include orphan enzymes. DeepRES is positioned as a tool to associate proteins of unknown function with reactions (including orphan enzymes) without requiring a fixed reaction classification like EC numbers.

What’s scientifically plausible vs what’s not yet proven

Plausible & supported by the design: multimodal embeddings can enable cross-modal retrieval if training data coverage is sufficient; the two-step gating (enzyme vs non-enzyme) is also a coherent way to reduce reaction retrieval clutter.
Still uncertain: whether the top retrieved reactions correspond to true biochemical catalysis for specific candidate proteins in vivo/in vitro is not directly demonstrated in the provided text (no wet-lab validation is reported in the excerpt). So mapping counts and downstream BGC candidate discovery are evidence of computational plausibility, not biochemical truth.
Dataset-coverage dependence: EnzymeCLIP is anchored to reaction information availability; if reaction encoding is incomplete (or reaction SMILES representation is imperfect), candidate retrieval may be systematically biased toward well-represented reaction subsets and miss reactions with poor encoding. The paper explicitly notes a limitation: reaction encoding based on RXNFP can have difficulty with chirality, and they report that ~40% of predicted enzymes were not linked to any known enzymatic reactions.

5) Evidence assessment (only using what the paper text provides)

Model evaluation style

EnzymeCNN: optimized on Swiss-Prot + AlphaFold DB representations; ablated sequence-only vs structure-only (EnzymeCNN-AA vs EnzymeCNN-3Di); compared against MMseqs2 and Foldseek under “similarity-limited” test settings and on a new protein dataset.
EnzymeCLIP: tuned architecture/training settings using Swiss-Prot/AlphaFoldDB/Rhea; compared against CLEAN for EC prediction using top-k accuracy; screened metagenomic proteins via EnzymeCNN gating and top-10% threshold for positive selection.

6) Critical blind spots & falsification targets

Main blind spots (what could mislead)

Many-to-many biology: enzyme multifunctionality and convergent evolution produce many-to-many protein–reaction links; CLIP-like training often assumes a strong one-to-one correspondence within batches, which the authors themselves discuss as a limitation (and they test soft contrastive loss variants that reduced performance).
Reaction encoding fidelity (chirality): stereoselectivity is central in enzymes; if Reaction SMILES encoding struggles with chirality recognition, retrieval can be systematically off-target for stereospecific reactions.
“Unlinked predicted enzymes” is not necessarily false: the reported ~40% unlink rate could reflect genuine unknown enzymatic activities not present in the reaction dataset (or incomplete reaction representations), but it also could reflect model errors. Without external biochemical validation, you cannot disentangle these explanations.
Downstream BGC inference remains correlative: candidate BGC enrichment based on genomic neighborhood/homology is hypothesis-generating, not proof of pathway functionality. The paper does not provide wet-lab pathway reconstruction details in the excerpt.

What would most disprove the approach (high leverage tests)

The following falsification targets are scientifically targeted to the core claims:

Protein–reaction retrieval validity: identify a set of top-ranked orphan enzyme candidates for specific reactions and show low experimental catalytic concordance relative to non-candidates.
Reaction encoding/chirality validity: focus on stereospecific reactions where the paper predicts potential chirality encoding limitations; show systematic failure specifically on those stereoisomers if encoding is the limiting factor.
BGC-to-metabolite plausibility: for BGC candidates in EPM0520/EPM0522-linked genomes, show absence of predicted pathway products despite gene co-localization—or conversely, show pathway products where DeepRES retrieval does not nominate candidates.

The paper’s own stated limitations around CLIP assumptions and chirality/reactant encoding motivate the above tests.

7) Practical utility: where DeepRES can be immediately useful

Actionable benefits (from the excerpt)

Hypothesis generation: provides a candidate protein set for orphan reactions, potentially supporting downstream homology mapping into microbial genomes for pathway-level discovery.
Pipeline integration claim: the paper claims orphan enzyme candidates can serve as a reference database for homology search and can be integrated into current genomic annotation pipelines (described as beneficial because it does not require GPU for homology search).

8) Reproducibility & transparency

What is available

The paper states code is available on GitHub and model weights/data and application results are accessible via Zenodo (with a DOI).

Conflict of interest awareness (skeptical, non-ideological)

The excerpt reports that corresponding author Takuji Yamada is a founder of multiple companies, with a statement that these companies had no control over interpretation/writing/publication. This does not invalidate computational work, but it does motivate stronger scrutiny of benchmarking choices, thresholding, and claims of superiority.

Take-home (one-paragraph, critical but fair)

DeepRES is a coherent, reaction-centric computational framework that combines enzyme gating with cross-modal protein–reaction retrieval using structure-aware protein representations and Reaction SMILES encodings, and it claims strong performance and meaningful orphan-enzyme mapping at large metagenomic scale, followed by BGC hypothesis generation for gut-associated modules (e.g., anthocyanin degradation). However, the approach’s ultimate biochemical validity for specific orphan candidates remains unproven in the excerpt, and the paper itself highlights important uncertainty sources: many-to-many protein–reaction relationships challenging CLIP assumptions, potential chirality/encoding limitations in Reaction SMILES NLP, and a substantial fraction of predicted enzymes that are not linked to known reactions—whose explanation could be either model error or genuinely unknown enzymatic activity.

Author reviews (bespoke buttons)

Feedback:

Updated: April 20, 2026