Claim SALAD is a family of sparse all-atom denoising models that generate designable and diverse protein backbones up to 1,000 residues with much lower runtime than prior diffusion models, and can be adapted via structure-editing to tasks unseen during training
The manuscript presents salad, a family of sparse all-atom denoising diffusion models for protein backbone generation. Key claims are: (a) substantially improved runtime vs prior diffusion models, (b) comparable or higher designability and diversity across protein lengths (50β1000 residues) when evaluated with standard in silico design pipelines (ProteinMPNN then ESMfold/AF2), and (c) extensibility to out-of-distribution tasks using structure-editing (motif scaffolding, shape conditioning, repeat and multi-state design)
Positive: code, Colab notebooks and parameter files are published on GitHub (repository described in README), and authors document hyperparameters, noise schedules, sampling steps, and evaluation pipelines (ProteinMPNN settings, ESMfold/AF2). This materially helps reproducibility. Remaining gaps: (a) full training logs and compute budgets are not exhaustively tabulated in the provided excerpts, (b) explicit random seeds and environment/container artifacts should be supplied for exact replication (authors mention apptainer/docker roadmap)
Each recommendation is feasible and would materially reduce uncertainty about biological validity and generality.
Summary judgement: salad presents a technical advance (sparse attention denoiser plus editing hooks) that meaningfully improves runtime and demonstrates in silico performance competitive with or exceeding recent diffusion models across a broad range of tasks and sizes . However, biological utility remains provisional until experimental folding/stability/function are demonstrated; dataset limitations constrain immediate applications to enzyme and small-molecule binder design.
What would overturn the core claims? If experimental tests showed low folding/stability despite favorable ESMfold/AF2 metrics, or retraining on AlphaFold DB removes designability advantages, the comparative claims would need re-evaluation.
You can run an iterative bioinformatics agent to reproduce and extend key analyses (generate backbones, run ProteinMPNN, refold with ESMfold/AF2, compute designability/diversity statistics, and bootstrap confidence intervals). Click below to start:
Custom summaries of the latest cutting edge Science research. Every Friday. No Ads.