zai-tts

High-quality text-to-speech audio generation using GLM-TTS with customizable voices and playback parameters. Converts text or file content to WAV audio via the uvx zai-tts command with configurable output paths Supports three built-in system voices (Lila, Chloe, Ethan) plus custom cloned voices created on audio.z.ai Adjustable speed (1.5x, etc.) and volume (2x, etc.) parameters for fine-tuned audio output Requires environment variables ZAI_AUDIO_USERID and ZAI_AUDIO_TOKEN obtained from audio.z.ai console authentication

INSTALLATION
npx skills add https://github.com/aahl/skills --skill zai-tts
Run in your project or agent environment. Adjust flags if your CLI version differs.

SKILL.md

$2a

Changing the voice

uvx zai-tts -t "{msg}" -o {tempdir}/{filename}.wav --voice system_002

Available voices

system_001: Lila. A cheerful, standard-pronunciation female voice

system_002: Chloe. A gentle, elegant, intelligent female voice

system_003: Ethan. A sunny, standard-pronunciation male voice

Retrieve all available voices using shell commands:

uvx zai-tts -l

If you want to use custom voices, please complete voice cloning on the website audio.z.ai first.

BrowserAct

Let your agent run on any real-world website

Bypass CAPTCHA & anti-bot for free. Start local, scale to cloud.

Explore BrowserAct Skills →

Stop writing automation&scrapers

Install the CLI. Run your first Skill in 30 seconds. Scale when you're ready.

Start free
free · no credit card