openai-whisper

Local speech-to-text transcription using OpenAI's Whisper CLI without API keys. Transcribes audio files (MP3, M4A, and other formats) directly on your machine with no external API calls required Supports multiple model sizes (tiny, base, small, medium, large, turbo) with automatic caching to ~/.cache/whisper on first run Offers transcription and translation tasks with configurable output formats (TXT, SRT, JSON, VTT) Requires only the whisper CLI binary, installable via Homebrew or pip

INSTALLATION
npx skills add https://github.com/steipete/clawdis --skill openai-whisper
Run in your project or agent environment. Adjust flags if your CLI version differs.

SKILL.md

Whisper (CLI)

Use whisper to transcribe audio locally.

Quick start

  • whisper /path/audio.mp3 --model medium --output_format txt --output_dir .
  • whisper /path/audio.m4a --task translate --output_format srt

Notes

  • Models download to ~/.cache/whisper on first run.
  • --model defaults to turbo on this install.
  • Use smaller models for speed, larger for accuracy.
BrowserAct

Let your agent run on any real-world website

Bypass CAPTCHA & anti-bot for free. Start local, scale to cloud.

Explore BrowserAct Skills →

Stop writing automation&scrapers

Install the CLI. Run your first Skill in 30 seconds. Scale when you're ready.

Start free
free · no credit card