firecrawl-scrape

Extract clean markdown from any URL, including JavaScript-rendered single-page applications. Handles both static pages and JS-rendered SPAs with configurable wait times for rendering Supports multiple concurrent URL scraping with output format options including markdown, HTML, links, and screenshots Includes content filtering options like main-content-only mode to strip navigation and footers, plus tag inclusion/exclusion Optional inline question answering via --query flag for targeted extraction without saving full page content

INSTALLATION
npx skills add https://github.com/firecrawl/cli --skill firecrawl-scrape
Run in your project or agent environment. Adjust flags if your CLI version differs.

SKILL.md

$2a

Wait for JS to render, then scrape

firecrawl scrape "" --wait-for 3000 -o .firecrawl/page.md

Multiple URLs (each saved to .firecrawl/)

firecrawl scrape https://example.com https://example.com/blog https://example.com/docs

Get markdown and links together

firecrawl scrape "" --format markdown,links -o .firecrawl/page.json

Ask a question about the page

firecrawl scrape "https://example.com/pricing" --query "What is the enterprise plan price?"

## Options

| Option                   | Description                                                      |

| ------------------------ | ---------------------------------------------------------------- |

| `-f, --format <formats>` | Output formats: markdown, html, rawHtml, links, screenshot, json |

| `-Q, --query <prompt>`   | Ask a question about the page content (5 credits)                |

| `-H`                     | Include HTTP headers in output                                   |

| `--only-main-content`    | Strip nav, footer, sidebar — main content only                   |

| `--wait-for <ms>`        | Wait for JS rendering before scraping                            |

| `--include-tags <tags>`  | Only include these HTML tags                                     |

| `--exclude-tags <tags>`  | Exclude these HTML tags                                          |

| `-o, --output <path>`    | Output file path                                                 |

## Tips

- **Prefer plain scrape over `--query`.** Scrape to a file, then use `grep`, `head`, or read the markdown directly — you can search and reason over the full content yourself. Use `--query` only when you want a single targeted answer without saving the page (costs 5 extra credits).

- **Try scrape before interact.** Scrape handles static pages and JS-rendered SPAs. Only escalate to `interact` when you need interaction (clicks, form fills, pagination).

- Multiple URLs are scraped concurrently — check `firecrawl --status` for your concurrency limit.

- Single format outputs raw content. Multiple formats (e.g., `--format markdown,links`) output JSON.

- Always quote URLs — shell interprets `?` and `&#x26;` as special characters.

- Naming convention: `.firecrawl/{site}-{path}.md`

## See also

- [firecrawl-search](../firecrawl-search/SKILL.md) — find pages when you don't have a URL

- [firecrawl-interact](../firecrawl-interact/SKILL.md) — when scrape can't get the content, use `interact` to click, fill forms, etc.

- [firecrawl-download](../firecrawl-download/SKILL.md) — bulk download an entire site to local files
BrowserAct

Let your agent run on any real-world website

Bypass CAPTCHA & anti-bot for free. Start local, scale to cloud.

Explore BrowserAct Skills →

Stop writing automation&scrapers

Install the CLI. Run your first Skill in 30 seconds. Scale when you're ready.

Start free
free · no credit card