SKILL.md

$27

Verify CLI is available

researchclaw --help

**Requirements:** Python 3.11+

---

## Configuration

cp config.researchclaw.example.yaml config.arc.yaml


### Minimum config ( config.arc.yaml )

project:

research:

topic: "Your research topic here"

llm:

provider: "openai"

base_url: "https://api.openai.com/v1"

api_key_env: "OPENAI_API_KEY"

primary_model: "gpt-4o"

fallback_models: ["gpt-4o-mini"]

experiment:

mode: "sandbox"

sandbox:

python_path: ".venv/bin/python"

export OPENAI_API_KEY="$YOUR_OPENAI_KEY"


### OpenRouter config (200+ models)

llm:

provider: "openrouter"

api_key_env: "OPENROUTER_API_KEY"

primary_model: "anthropic/claude-3.5-sonnet"

fallback_models:

- "google/gemini-pro-1.5"

- "meta-llama/llama-3.1-70b-instruct"

export OPENROUTER_API_KEY="$YOUR_OPENROUTER_KEY"


### ACP (Agent Client Protocol) — no API key needed

llm:

provider: "acp"

acp:

agent: "claude" # or: codex, gemini, opencode, kimi

cwd: "."


The agent CLI (e.g. `claude`) handles its own authentication.

### OpenClaw bridge (optional advanced capabilities)

openclaw_bridge:

use_cron: true # Scheduled research runs

use_message: true # Progress notifications

use_memory: true # Cross-session knowledge persistence

use_sessions_spawn: true # Parallel sub-sessions

use_web_fetch: true # Live web search in literature review

use_browser: false # Browser-based paper collection


## Key CLI Commands

Basic run — fully autonomous, no prompts

researchclaw run --topic "Your research idea" --auto-approve

Run with explicit config file

researchclaw run --config config.arc.yaml --topic "Mixture-of-experts routing efficiency" --auto-approve

Run with topic defined in config (omit --topic flag)

researchclaw run --config config.arc.yaml --auto-approve

Interactive mode — pauses at gate stages for approval

researchclaw run --config config.arc.yaml --topic "Your topic"

Check pipeline status / resume a run

researchclaw status --run-id rc-20260315-120000-abc123

List past runs

researchclaw list


**Gate stages** (5, 9, 20) pause for human approval in interactive mode. Pass `--auto-approve` to skip all gates.

## Python API

from researchclaw.pipeline import Runner

from researchclaw.config import load_config

Load config and run

config = load_config("config.arc.yaml")

config.research.topic = "Efficient attention mechanisms for long-context LLMs"

config.auto_approve = True

runner = Runner(config)

result = runner.run()

Access outputs

print(result.artifact_dir) # artifacts/rc-YYYYMMDD-HHMMSS-<hash>/

print(result.deliverables_dir) # .../deliverables/

print(result.paper_draft_path) # .../deliverables/paper_draft.md

print(result.latex_path) # .../deliverables/paper.tex

print(result.bibtex_path) # .../deliverables/references.bib

print(result.verification_report) # .../deliverables/verification_report.json

Run specific stages only

from researchclaw.pipeline import Runner, StageRange

runner = Runner(config)

result = runner.run(stages=StageRange(start="LITERATURE_COLLECT", end="KNOWLEDGE_EXTRACT"))

Access knowledge base after a run

from researchclaw.knowledge import KnowledgeBase

kb = KnowledgeBase.load(result.artifact_dir)

findings = kb.get("findings")

literature = kb.get("literature")

decisions = kb.get("decisions")


## Output Structure

After a run, all outputs land in `artifacts/rc-YYYYMMDD-HHMMSS-<hash>/`:

artifacts/rc-20260315-120000-abc123/

├── deliverables/

│ ├── paper_draft.md # Full academic paper (Markdown)

│ ├── paper.tex # Conference-ready LaTeX

│ ├── references.bib # Real BibTeX — auto-pruned to inline citations

│ ├── verification_report.json # 4-layer citation integrity report

│ └── reviews.md # Multi-agent peer review

├── experiment_runs/

│ ├── run_001/

│ │ ├── code/ # Generated experiment code

│ │ ├── results.json # Structured metrics

│ │ └── sandbox_output.txt # Execution logs

├── charts/

│ └── *.png # Auto-generated comparison charts

├── evolution/

│ └── lessons.json # Self-learning lessons for future runs

└── knowledge_base/

├── decisions.json

├── experiments.json

├── findings.json

├── literature.json

├── questions.json

└── reviews.json


## Pipeline Stages Reference

Phase
Stage #
Name
Notes

A
1
TOPIC_INIT
Parse and scope research topic

A
2
PROBLEM_DECOMPOSE
Break into sub-problems

B
3
SEARCH_STRATEGY
Build search queries

B
4
LITERATURE_COLLECT
Real API calls to arXiv + Semantic Scholar

B
5
LITERATURE_SCREEN
**Gate** — approve/reject literature

B
6
KNOWLEDGE_EXTRACT
Extract structured knowledge

C
7
SYNTHESIS
Synthesize findings

C
8
HYPOTHESIS_GEN
Multi-agent debate to form hypotheses

D
9
EXPERIMENT_DESIGN
**Gate** — approve/reject design

D
10
CODE_GENERATION
Generate experiment code

D
11
RESOURCE_PLANNING
GPU/MPS/CPU auto-detection

E
12
EXPERIMENT_RUN
Sandboxed execution

E
13
ITERATIVE_REFINE
Self-healing on failure

F
14
RESULT_ANALYSIS
Multi-agent analysis

F
15
RESEARCH_DECISION
PROCEED / REFINE / PIVOT

G
16
PAPER_OUTLINE
Structure paper

G
17
PAPER_DRAFT
Write full paper

G
18
PEER_REVIEW
Evidence-consistency check

G
19
PAPER_REVISION
Incorporate review feedback

H
20
QUALITY_GATE
**Gate** — final approval

H
21
KNOWLEDGE_ARCHIVE
Save lessons to KB

H
22
EXPORT_PUBLISH
Emit LaTeX + BibTeX

H
23
CITATION_VERIFY
4-layer anti-hallucination check

## Common Patterns

### Pattern: Quick paper on a topic

export OPENAI_API_KEY="$OPENAI_API_KEY"

researchclaw run \

--topic "Self-supervised learning for protein structure prediction" \

--auto-approve


### Pattern: Reproducible run with full config

config.arc.yaml

project:

research:

topic: "Self-supervised learning for protein structure prediction"

llm:

provider: "openai"

api_key_env: "OPENAI_API_KEY"

primary_model: "gpt-4o"

fallback_models: ["gpt-4o-mini"]

experiment:

mode: "sandbox"

sandbox:

python_path: ".venv/bin/python"

max_iterations: 3

timeout_seconds: 300

researchclaw run --config config.arc.yaml --auto-approve


### Pattern: Use Claude via OpenRouter for best reasoning

export OPENROUTER_API_KEY="$OPENROUTER_API_KEY"

cat > config.arc.yaml << 'EOF'

project:

llm:

provider: "openrouter"

api_key_env: "OPENROUTER_API_KEY"

primary_model: "anthropic/claude-3.5-sonnet"

fallback_models: ["google/gemini-pro-1.5"]

experiment:

mode: "sandbox"

sandbox:

python_path: ".venv/bin/python"

EOF

researchclaw run --config config.arc.yaml \

--topic "Efficient KV cache compression for transformer inference" \

--auto-approve


### Pattern: Resume after a failed run

List runs to find the run ID

researchclaw list

Resume from last completed stage

researchclaw run --resume rc-20260315-120000-abc123


### Pattern: Programmatic batch research

import asyncio

from researchclaw.pipeline import Runner

from researchclaw.config import load_config

topics = [

"LoRA fine-tuning on limited hardware",

"Speculative decoding for LLM inference",

"Flash attention variants comparison",

]

config = load_config("config.arc.yaml")

config.auto_approve = True

for topic in topics:

config.research.topic = topic

runner = Runner(config)

result = runner.run()

print(f"[{topic}] → {result.deliverables_dir}")


### Pattern: OpenClaw one-liner (if using OpenClaw agent)

Share the repo URL with OpenClaw, then say:

"Research mixture-of-experts routing efficiency"


OpenClaw auto-reads `RESEARCHCLAW_AGENTS.md`, clones, installs, configures, and runs the full pipeline.

## Compile the LaTeX Output

Navigate to deliverables

cd artifacts/rc-*/deliverables/

Compile (requires a LaTeX distribution)

pdflatex paper.tex

bibtex paper

pdflatex paper.tex

Or upload paper.tex + references.bib directly to Overleaf


## Troubleshooting

### researchclaw: command not found

Make sure the venv is active and package is installed

source .venv/bin/activate

pip install -e .

which researchclaw


### API key errors

Verify env var is set

echo $OPENAI_API_KEY

Should print your key (not empty)

Set it explicitly for the session

export OPENAI_API_KEY="sk-..."


### Experiment sandbox failures

The pipeline self-heals at Stage 13 (ITERATIVE_REFINE). If it keeps failing:

Increase timeout and iterations in config

experiment:

max_iterations: 5

timeout_seconds: 600

sandbox:

python_path: ".venv/bin/python"


### Citation hallucination warnings

Stage 23 (CITATION_VERIFY) runs a 4-layer check. If references are pruned:

- This is **expected behaviour** — fake citations are removed automatically

- Check `verification_report.json` for details on which citations were rejected and why

### PIVOT loop running indefinitely

Stage 15 (RESEARCH_DECISION) may pivot multiple times. To cap iterations:

research:

max_pivots: 2

max_refines: 3


### LaTeX compilation errors

Check for missing packages

pdflatex paper.tex 2>&1 | grep "File.*not found"

Install missing packages (TeX Live)

tlmgr install <package-name>


### Out of memory during experiments

Force CPU mode in config

experiment:

sandbox:

device: "cpu"

max_memory_gb: 4

autoresearchclaw-autonomous-research

SKILL.md

Verify CLI is available

Basic run — fully autonomous, no prompts

Run with explicit config file

Run with topic defined in config (omit --topic flag)

Interactive mode — pauses at gate stages for approval

Check pipeline status / resume a run

List past runs

Load config and run

Access outputs

Run specific stages only

Access knowledge base after a run

config.arc.yaml

List runs to find the run ID

Resume from last completed stage

Navigate to deliverables

Compile (requires a LaTeX distribution)

Or upload paper.tex + references.bib directly to Overleaf

Make sure the venv is active and package is installed

Verify env var is set

Should print your key (not empty)

Set it explicitly for the session

Increase timeout and iterations in config

Check for missing packages

Install missing packages (TeX Live)

Force CPU mode in config

Stop writing automation&scrapers

autoresearchclaw-autonomous-research

SKILL.md

Verify CLI is available

Basic run — fully autonomous, no prompts

Run with explicit config file

Run with topic defined in config (omit --topic flag)

Interactive mode — pauses at gate stages for approval

Check pipeline status / resume a run

List past runs

Load config and run

Access outputs

Run specific stages only

Access knowledge base after a run

config.arc.yaml

List runs to find the run ID

Resume from last completed stage

Navigate to deliverables

Compile (requires a LaTeX distribution)

Or upload paper.tex + references.bib directly to Overleaf

Make sure the venv is active and package is installed

Verify env var is set

Should print your key (not empty)

Set it explicitly for the session

Increase timeout and iterations in config

Check for missing packages

Install missing packages (TeX Live)

Force CPU mode in config

Let your agent run on any real-world website

Related skills

Stop writing automation&scrapers