elevenlabs-music-generation

>

INSTALLATION
npx skills add https://github.com/doany-ai/skills --skill elevenlabs-music-generation
Run in your project or agent environment. Adjust flags if your CLI version differs.

SKILL.md

$2b

2. Sign in

runcomfy login # or in CI: export RUNCOMFY_TOKEN=

3. Generate music

runcomfy run elevenlabs/elevenlabs/music-generation

--input '{"prompt": "..."}'

--output-dir ./out

CLI deep dive: [`runcomfy-cli`](https://www.skills.sh/agentspace-so/runcomfy-agent-skills/runcomfy-cli) skill.

## When to use ElevenLabs Music

ElevenLabs Music's strength is **structured songs with real vocals** — it takes a style brief plus lyrics with section markers and returns a coherent, mixed track. Pick it for:

- **Full vocal songs** — verse/chorus structure, multilingual lyrics, consistent meter

- **Instrumental beds** — `force_instrumental: true` for background music, podcast intros, game loops

- **Short brand assets** — jingles, stingers, theme music (5–30 s)

- **Long-form tracks** — up to 5 minutes in a single call

- **Commercial work** — output is commercial-friendly

If the user just wants ambient sound or a one-off SFX (thunder, footsteps), that's a sound-effects task, not music — ElevenLabs Music is for *songs and tracks*.

## Endpoint + input schema

**Model**: `elevenlabs/elevenlabs/music-generation`

| Field | Type | Required | Default | Notes |

|---|---|---|---|---|

| `prompt` | string | yes | — | Style description **and** lyrics with section markers. See prompting tips |

| `music_length_ms` | int | no | `40000` | Output duration in ms. **5000–300000** (5 s – 5 min) |

| `force_instrumental` | bool | no | `false` | `true` = instrumental only, no vocals |

| `output_format` | string | no | `mp3_standard` | `mp3_standard` (default), or WAV — see the [model page](https://www.runcomfy.com/models/elevenlabs/elevenlabs/music-generation?utm_source=skills.sh&utm_medium=skill&utm_campaign=elevenlabs-music-generation) API tab for the full format list |

Output: 44.1 kHz stereo audio. The result JSON contains the generated audio URL — the CLI downloads it into `--output-dir`.

**Pricing**: ~$0.0083 per second of generated audio (30 s ≈ $0.25, 60 s ≈ $0.50, 5 min ≈ $2.49). Cost scales with `music_length_ms`, so draft short and finalize long.

## How to invoke

**Full vocal song with structure:**

runcomfy run elevenlabs/elevenlabs/music-generation \

--input '{

"prompt": "Upbeat indie-pop anthem, bright electric guitars, driving drums, 120 BPM, female lead vocal. [Intro 8 bars] instrumental build. [Verse] Chalk on the palms, laces double-knotted, morning on the ridge. [Chorus] We rise, we strike, we never fade out. [Bridge] soft breakdown, just piano and voice. [Outro] full band, fade.",

"music_length_ms": 60000

}' \

--output-dir ./out


**Instrumental background bed:**

runcomfy run elevenlabs/elevenlabs/music-generation \

--input '{

"prompt": "Calm lo-fi hip-hop instrumental for a study playlist. Warm Rhodes piano, soft vinyl crackle, mellow boom-bap drums, 75 BPM. No vocals. Consistent loop-friendly groove throughout.",

"music_length_ms": 90000,

"force_instrumental": true

}' \

--output-dir ./out


**Short brand jingle:**

runcomfy run elevenlabs/elevenlabs/music-generation \

--input '{

"prompt": "5-second cheerful brand stinger, bright marimba and a single uplifting chord resolve, no vocals.",

"music_length_ms": 5000,

"force_instrumental": true

}' \

--output-dir ./out

BrowserAct

Let your agent run on any real-world website

Bypass CAPTCHA & anti-bot for free. Start local, scale to cloud.

Explore BrowserAct Skills →

Stop writing automation&scrapers

Install the CLI. Run your first Skill in 30 seconds. Scale when you're ready.

Start free
free · no credit card