SKILL.md
Medical Imaging AI Literature Review Skill (v3.0.0)
Produce comprehensive reviews that pass first-round peer review on factual grounds, not just structural grounds.
This is not a template-filling skill. It is a write-with-verify discipline.
Quick Start
A review project lives in 4 files (3 you write, 1 the skill provides):
project_root/
├── PARADIGM.md # Style spec from 2-3 exemplar reviews (Phase 0)
├── CLAUDE.md # Project-specific terminology + literature inventory
├── IMPLEMENTATION_PLAN.md # 3-axis outline + per-claim verification checklist
└── manuscript_draft.md # The actual manuscript
Follow the 6-phase workflow in references/WORKFLOW.md. The phases are: paradigm capture → init → collect-and-verify → 3-axis outline → write-with-per-claim-verification → multi-agent peer review.
Core Principles
Writing voice — match strength to evidence, not hedge by default
Calibrate language to evidence strength, not to a fixed hedging register.
When ≥2 independent peer-reviewed groups confirm a finding, state it strongly. When evidence is single-source or contested, state it cautiously. When evidence is absent, say so.
Avoid the LLM tells:
- "has shown promising results"
- "may suggest"
- "interestingly,"
- "it is worth noting that"
- "in recent years,"
- "demonstrates the effectiveness of"
- "may offer significant advantages"
These phrases are AI-detector top features. Real flagship-review authors don't use them. Strip them.
Take a position when evidence supports it. Neutral catalogue is the LLM default and the failure mode to avoid. See [Verdict sentences](#verdict-sentences) below.
Citations — every claim verified before commit
Every [N] citation must satisfy four checks:
- The cited paper exists (DOI / PMID resolves on PubMed or Crossref).
- The author list matches the first-source (especially first and last author).
- The numeric claim in the body sentence (Dice, HR, sample size, etc.) appears in the cited paper's abstract or results section.
- The directional claim in the body sentence (higher/lower, increased/decreased) matches the source's stated direction.
If any check fails, the citation is broken — fix before continuing. See references/CITATION_INTEGRITY.md for the full protocol.
Method descriptions — read first, write after
Do not fill in a template like [Author] et al. [ref] proposed [method]... Achieves Dice of X.XX. That template is a hallucination trap.
Use this discipline instead:
- Read the actual paper (abstract + methods + results). For arXiv, use
read_paper. For closed-access, use Zotero MCP to access the user's library.
- Note the actual module names, the actual benchmark, the actual numbers, in your own working notes — not in the manuscript yet.
- Write the method description from those notes, citing specific numbers and module names verbatim from the paper.
- Verify by spot-checking 1-2 of the numbers against the paper one more time before moving on.
If you can't access the paper, do not write about its internal architecture or specific performance numbers. Cite it for the contribution-level claim only ("first to apply X to Y") and move on.
Heading depth — Nature Reviews uses two levels
- H2 (
##) for top-level sections (Introduction, Methods, Applications, Discussion, ...).
- H3 (
###) for subsections.
- H4 (
####) is forbidden in body. Use bold lead-in**Topic.**paragraph starters for deeper grouping.
- Number prefixes (
1.,1.1,1.2.3) are forbidden in section titles. Nature Reviews / Nat Med / Lancet / JACC don't use them in narrative reviews.
Equations — in a Box, not in body
Display equations (DSC, IoU, clDice, FedAvg, GCN propagation, ...) appear in Boxes, not inline in body paragraphs. Textbook formulas can be referenced ("the Dice similarity coefficient — see Box 1") but should not be displayed inline.
If a formula has no methodological insight worth displaying (e.g., FedAvg averaging), describe it in prose instead of showing it.
Vendor names — only in the regulatory/products table
Vendor names (HeartFlow, Cleerly, Caristo, Keya, Shukun, ...) appear ONLY in the Commercial Products / Regulatory & Validation table. In body text use category descriptors:
- ✗ "HeartFlow's CT-FFR product was validated in NXT, ADVANCE, and PACIFIC..."
- ✓ "The first FDA-cleared CT-FFR product (Table N, row 1) was validated in NXT, ADVANCE, and PACIFIC..."
Reason: scatter-cited vendor names look like marketing copy and undermine the review's authority.
Standard Review Structure
# [Title]: <evocative subtitle>
## Key Points
- 4-5 bullets, each 1-3 sentences, expressing the main conclusions.
## Abstract
## Introduction
### Clinical background
### Technical challenge
### Scope and contributions
## Datasets and evaluation metrics
(Table 1: public datasets)
(Box 1: evaluation metrics with equations)
## Methods # 3-axis grouping, NOT flat 10-subsection list
### Architectural priors
**CNN-based design.** ... (bold lead-in for sub-grouping)
**Transformer-based design.** ...
**Mamba and state-space design.** ...
### Inductive priors
**Topology-aware design.** ...
**Multi-task design.** ...
**Graph-based design.** ...
### Data regime
**Self-supervised pre-training.** ...
**Foundation models.** ...
**Federated learning.** ...
**Physics-informed models.** ...
(Table 2: representative methods with modality / family / dataset / metric)
## Downstream applications
### [Application 1]
### [Application 2]
### [Application 3]
## Translation to clinical practice
(Table 3: commercial products with regulatory + validation)
## Outstanding challenges
## Future directions
## References
Notes:
- No number prefixes on headings.
- §Methods is 3 H3 subsections (the three axes), with bold lead-ins for each method family inside.
- Tables 1, 2, 3 are typically enough. Box 1 (metrics) is typical. Avoid 5+ tables.
- Verdict sentences cluster at the end of §Methods axis subsections and at the end of clinical translation discussions — not after every paragraph.
Verdict Sentences
Each H3 method-axis subsection (Architectural priors / Inductive priors / Data regime) should close with one verdict sentence expressing authorial position. Choose the 3-5 most opinionated positions across the whole manuscript — don't put verdicts on every paragraph.
Verdict templates:
- "[Family] is currently the most cost-effective design choice for [problem]."
- "[Family] has yet to demonstrate clear advantage over [alternative] in clinical-grade evaluations."
- "[Family] is best understood as complementary to [alternative], not a replacement."
- "The next [N] years will determine whether [family] becomes the default backbone or remains a research curiosity."
Neutral catalogue is the LLM default and exactly what flagship review editors push back on. Force yourself to take 3-5 positions.
Required Elements
- Key Points box (4-5 bullets, 1-3 sentences each) after the title.
- Tables 1-3: datasets, methods, commercial products.
- Box 1: evaluation metrics with formulas.
- Figures: typically 3-5 (overview/taxonomy, representative architectures, workflow, performance landscape).
- References: cite only what supports the argument. Quantity is downstream of substance — don't pad to a target count.
- Verdict sentences: 3-5 across the whole manuscript, clustered at axis-section ends.
Heading Depth
See [Core Principles ▸ Heading depth](#heading-depth--nature-reviews-uses-two-levels) above. Hard rules:
- Max 2 heading levels in body.
- No number prefixes.
- Use bold lead-in
**Topic.**for deeper subsubsections.
Equations
See [Core Principles ▸ Equations](#equations--in-a-box-not-in-body) above. All display equations go in Box 1 (or rare additional Boxes for specific protocols). Textbook formulas with no methodological insight should be described in prose, not displayed.
Vendor Names
See [Core Principles ▸ Vendor names](#vendor-names--only-in-the-regulatoryproducts-table) above. Vendor names live in Table 3 only; body text uses category descriptors with table cross-reference.
Citation Style
# Data citation
"...achieved Dice of 0.730 on ImageCAS [N]"
# Method citation
"Xu et al. [N] introduced..."
# Multi-citation (max 4 in one bracket — beyond that, regroup the claim)
"Multiple groups demonstrated this effect [N1, N2, N3]"
# Comparative
"While [N1] focused on architecture, [N2] addressed the data side"
[N] in body must match the bibliography entry [N], and bibliography [N] must be the paper the body sentence is actually attributing the claim to. See references/CITATION_INTEGRITY.md Rule 3.
Literature Sources
Use all three in combination:
Source
Best for
Tools
ArXiv
Methodological preprints, ML/AI advances
mcp__arxiv-mcp-server__search_papers, read_paper
PubMed
Peer-reviewed clinical / validation studies
mcp__pubmed-mcp-server__pubmed_search_articles + WebFetch on PubMed
Zotero
User's local library (closed-access journals)
mcp__zotero__zotero_search_items, zotero_get_item_fulltext
Crossref
DOI verification
WebFetch on api.crossref.org/works/<DOI>
For closed-access journals (Med Image Anal, Eur Radiol, Lancet family) the user's local Zotero library is often the only path. Always check Zotero before assuming a paper is inaccessible.
For MCP server configuration, see references/MCP_SETUP.md.
Reference Files
File
Read when
Starting a new review or moving between phases
Phase 0: capturing exemplar review style spec
references/CITATION_INTEGRITY.md
Phase 2 (collection) and Phase 4 (write) — every citation must follow the 5 rules
references/HALLUCINATION_PATTERNS.md
Phase 4 (write) and Phase 5 (peer review) — checklist of 9 LLM hallucination indicators to self-check against
Phase 3 (outline) — 3-axis method groupings per domain
Phase 1 (init) — CLAUDE.md, IMPLEMENTATION_PLAN.md, table templates
references/QUALITY_CHECKLIST.md
Before delivering a draft to the user
Setting up arxiv-mcp / pubmedmcp / zotero-mcp
Related Skills
For revising an existing AI-drafted review (whether your own previous output or someone else's draft), use ai-review-revision. That skill is the dedicated tool for fixing draft-quality issues — multi-agent diagnostic, factual reset, structural reset, content polish, submission prep.
This skill (medical-imaging-review) is the dedicated tool for producing draft-quality content correctly the first time. They are complementary:
- medical-imaging-review = write-side (produce submission-quality first draft)
- ai-review-revision = revise-side (rescue a draft that already has quality issues)
If a draft produced by this skill still ends up needing the ai-review-revision workflow to land, that's a bug — flag it so this skill can be improved.
Why this skill was rewritten (v3 vs v2)
v2.0.0 produced the coronary-cta-paper initial draft. That draft needed extensive multi-day revision before submission-readiness: 17 placeholder DOIs, 30-40 [N] citation drift errors, fabricated method module names with wrong performance numbers, vendor-style citations attributed to peer-reviewed journals, a 10-subsection flat method taxonomy where 3 thematic axes would have served better, AI-tone hedging language throughout.
v3 directly addresses each of these failure modes:
v2 failure
v3 fix
Hedging mandate in Core Principles
Removed; replaced with "match voice to evidence"
80-120 reference count target
Removed; replaced with "cite what supports the argument"
Method fill-in template
Removed; replaced with "read-first, write-after" discipline
10-flat method subsection taxonomy
Replaced with 3-axis grouping in DOMAINS.md
QA = formal structural check
Replaced with per-claim verification embedded in Phase 4
No DOI / author / direction verification
Added as CITATION_INTEGRITY.md with 5 rules
No hallucination self-check
Added as HALLUCINATION_PATTERNS.md (9 patterns)
Numbered headings
Banned; max 2 levels, bold lead-in for deeper
Vendor names scattered
Confined to Table 3 only
Equations inline
Confined to Box 1 only
Verdict-free neutral catalogue
Required 3-5 verdict sentences
No exemplar paradigm capture
Added Phase 0 PARADIGM.md