Why BGPT?
logo

Review papers with raw data transparency

Quickly verify claims by accessing the underlying experimental data and figures.







Press Enter ↵ to solve



    Fuel Your Discoveries




     Quick Explanation



    Concise verdict

    The TCGA Pan-Cancer analysis (Kandoth et al., Nature 2013) systematically catalogs somatic point mutations across 3,281 tumours (12 types), identifies 127 significantly mutated genes (SMGs), links mutation signatures to exposures/repair defects, and uses VAF/clonality to infer temporal order — a high-quality, foundational pan-cancer resource that is well‑cited and highly reusable but limited by exome-only data and variable calling pipelines




     Long Explanation



    Visual, critical review — Kandoth et al., "Mutational landscape and significance across 12 major cancer types" (Nature 2013)

    Key visual takeaways (from data)

    • Scale & scope: 3,281 tumours across 12 types; 617,354 somatic variants compiled (missense, silent, nonsense, splice, indels) — raw scope enables cross-tissue comparisons and discovery of 127 SMGs
    • Mutation burden & signatures: marked heterogeneity — AML lowest (~0.28 muts/Mb), LUSC highest (~8.15 muts/Mb); context shows smoking-related C>A enrichment in LUAD/LUSC, CpG transitions in GI/GBM/UCEC, and tissue-specific trinucleotide contexts
    • Driver gene landscape: 127 SMGs spanning canonical (TP53, PIK3CA, KRAS) and emerging processes (histone modifiers, splicing factors, proteolysis); TP53 most frequent (42% pan-cohort) and highly prevalent in OV (95%)

    Critical appraisal — strengths and weaknesses (evidence-cited)

    Major strengths
    • Large, multi-tissue sample set enabling comparative pan-cancer discovery and cross-type clustering
    • Robust statistical pipeline: MuSiC for SMG calling, Dendrix for exclusivity, and Cox models for survival associations — mature, published methods then-currently accepted in the field
    Key limitations & blindspots
    • Exome-only scope: structural variants, gene fusions and noncoding drivers are not covered; therefore the 127 SMGs are a lower-bound set of relevant driver events (paper acknowledges this)
    • Heterogeneous sample processing: exome-capture kits, sequencing platforms and variant callers differed across TCGA centres; authors attempted standardization but sensitivity/specificity differences remain — introduces possible tissue-specific detection biases and reduces direct comparability between tumour types (Methods text describes this thoroughly)
    • Clinical covariates and confounding: survival associations use Cox models with age/gender (and sometimes stage), but many tumour-specific prognostic confounders (treatment, molecular subtype, comorbidities) were not fully modelled — credible but cautious interpretation required (authors note limitations)

    Where the paper influenced the field (concrete examples)

    1. Framed a reproducible pan-cancer approach (shared Synapse resources, MuSiC pipelines) that later Pan-Cancer Atlas efforts and PCAWG built upon
    2. Highlighted non-classical categories (histone modifiers, splicing factors) as recurrently mutated across cancers — stimulated mechanistic and therapeutic follow-ups (e.g., epigenetic-targeting research)

    Reproducibility & data availability

    Data and provenance were made available on Synapse and the authors provide methods and supplementary tables; the main reproducibility caveat is heterogeneous upstream pipelines (capture/caller) which the authors could not fully normalize without reprocessing raw BAMs — nevertheless the paper is highly reproducible at the level of analysis because MuSiC/Dendrix/sciClone code and inputs (MAF/coverage) were documented

    Practical recommendations for researchers using these results

    • Use the 127 SMG list as a reliable exome-derived candidate set, but complement with copy-number, structural variant and transcriptome data when possible (the paper’s authors note this)
    • When doing cross-tissue comparisons, control for differences in mutation-calling sensitivity and capture targets (recalculate coverage thresholds per cohort if possible).

    Suggestions the authors or follow-up studies could/should implement

    • Re-run a single, harmonized variant-calling pipeline on raw BAMs (if access permitted) to remove capture/caller biases and re-assess SMG calls and exclusivity sets.
    • Integrate whole-genome structural variant calls and DNA methylation to detect non-coding drivers and regulatory alterations that can explain driver exclusivity clusters.
    • Deeper modeling of survival including treatment, subtype, and multi-gene interactions (time-varying coefficients) to refine prognostic claims.

    Minimal, falsifiable counterclaims

    If independent cohorts re-called uniformly from BAMs (or WGS) show that many of the 127 SMGs are artifacts of capture/calling differences or that structural drivers explain cluster associations better, that would overturn parts of the paper’s gene-centric conclusions; conversely, stability of the SMG set under unified re-calling would strongly reinforce the findings.

    Run deeper analyses — to evolve this review into a reproducible re-analysis (harmonized variant calling, SMG re-testing, mutational signature topography, clonality across all types), click the button below to start an AI bioinformatics agent that will run the necessary pipelines.

    Bottom-line assessment

    Kandoth et al., Nature 2013 is a rigorous, high‑impact pan-cancer exome study that produced a durable SMG catalogue, clarified tissue-specific mutation spectra and early-vs-late driver timing, and provided reproducible pipelines and datasets — but its exome-only scope and heterogeneous upstream pipelines mean users should integrate WGS and harmonized calling where possible before drawing therapeutic conclusions. All main claims above are supported directly by the paper’s methods/results and supplementary tables

    Included visual materials: bar chart of tumour counts by type; simple SMG distribution visualization. For deeper re-analysis (signatures, VAF clustering across all types, exclusivity network plots), run the AI Scientist agent above.

    Citation for primary paper used in this review:


    Feedback:   

    Updated: March 16, 2026

    BGPT Paper Review



    Study Novelty

    80%

    At publication (2013) this was among the first large, multi-tissue, harmonized TCGA pan-cancer exome analyses; identifying 127 SMGs and integrating mutation spectra, exclusivity and clonality across 12 types was novel and influential.



    Scientific Quality

    90%

    High technical quality: large sample size, transparent methods (MuSiC/Dendrix/sciClone), Synapse-tracked provenance, and careful caveats about heterogeneous calling; main red-flag is exome-only scope and variable upstream pipelines which authors acknowledged and partially mitigated.



    Study Generality

    80%

    Findings generalized across many adult epithelial and hematologic cancers (12 types) and exposed broadly relevant pathways (TP53, PI3K, chromatin modifiers), but pediatric or rare tumour types and non-coding drivers remain outside scope.



    Study Usefulness

    90%

    The SMG catalog and mutation-spectrum maps serve as foundations for subsequent mechanistic and translational work (biomarker discovery, pathway studies), widely reused by TCGA/PCAWG and others; immediate clinical applications limited by exome-only design but highly useful for research.



    Study Reproducibility

    80%

    Data and many intermediate files were released on Synapse and methods are documented (MuSiC/Dendrix/sciClone). Residual reproducibility caveat: heterogeneous capture and calling across sequencing centres required harmonization that authors could not fully perform without reprocessing raw BAMs.



    Explanatory Depth

    70%

    Provides mechanistic suggestions (pathway categories, clonality timing via VAF) and points to processes (histone modifiers, splicing) but does not deliver deep biochemical mechanisms — appropriate for a large-scale discovery paper but requires follow-up functional work.


    🎁 Authors: Collect 500 Free Science Tokens (≈ $50.0 USD)

    Claim My Author Tokens

    Use for 125 days of free BGPT access (4 tokens = 1 day) or trade/sell (≈ $50.0 USD)

     Top Data Sources ExportMCP



     Analysis Wizard



    Preparing a harmonized variant-call matrix from raw BAMs (re-map to GRCh37, joint calling, annotation), producing per-sample coverage masks and re-running MuSiC-style SMG tests to re-evaluate the 127-gene list across 12 tumour types.



     Hypothesis Graveyard



    A single universal minimal set of driver genes explains oncogenesis across cancers — falsified because tissue-specific TF/regulator mutations and structural/epigenetic alterations are required in many tissues.


    All prognostic effects of SMGs are independent of tumour subtype — weakened: many associations are confounded by subtype, treatment and other covariates and require stratified analyses.

     Science Art


    Paper Review: Mutational landscape and significance across 12 major cancer types Science Art

     Science Movie



    Make a narrated HD Science movie for this answer ($32 per minute)




     Discussion








    Get Ahead With Science Insights

    Custom summaries of the latest cutting edge Science research. Every Friday. No Ads.


    My BGPT