ai-video-generation

Generate videos with 40+ AI models including Veo, Seedance, Wan, and Grok via inference.sh CLI. Supports text-to-video, image-to-video, avatar animation, lipsync, video upscaling, and foley sound generation across multiple model families Access to Google Veo (3.1, 3, 2), ByteDance Seedance, Falai Wan, xAI Grok, and economical alternatives like P-Video and WAN models Includes utility tools for video upscaling, sound effect generation, and multi-clip merging with transitions Requires inference.sh CLI ( infsh ) installation and authentication to run video generation commands

INSTALLATION
npx skills add https://github.com/inference-sh/skills --skill ai-video-generation
Run in your project or agent environment. Adjust flags if your CLI version differs.

SKILL.md

$2c

Available Models

Text-to-Video

Model

App ID

Best For

Veo 3.1 Fast

google/veo-3-1-fast

Fast, with optional audio

Veo 3.1

google/veo-3-1

Best quality, frame interpolation

Veo 3

google/veo-3

High quality with audio

Veo 3 Fast

google/veo-3-fast

Fast with audio

Veo 2

google/veo-2

Realistic videos

P-Video

pruna/p-video

Fast, economical, with audio support

WAN-T2V

pruna/wan-t2v

Economical 480p/720p

Grok Video

xai/grok-imagine-video

xAI, configurable duration

Seedance 2.0

bytedance/seedance-2-0

Text/image/ref-to-video with sync audio, up to 1080p

Seedance 2.0 Fast

bytedance/seedance-2-0-fast

Fast variant, same capabilities

HappyHorse T2V

alibaba/happyhorse-1-0-t2v

Physically realistic, up to 15s

Image-to-Video

Model

App ID

Best For

Wan 2.5

falai/wan-2-5

Animate any image

Wan 2.5 I2V

falai/wan-2-5-i2v

High quality i2v

WAN-I2V

pruna/wan-i2v

Economical 480p/720p

P-Video

pruna/p-video

Fast i2v with audio

Seedance 2.0

bytedance/seedance-2-0

Animate images with sync audio, up to 1080p

Seedance 2.0 Fast

bytedance/seedance-2-0-fast

Fast variant, same capabilities

HappyHorse I2V

alibaba/happyhorse-1-0-i2v

Animate images, up to 1080P/15s

HappyHorse R2V

alibaba/happyhorse-1-0-r2v

Character-preserving from references

Avatar / Lipsync

Model

App ID

Best For

OmniHuman 1.5

bytedance/omnihuman-1-5

Multi-character

OmniHuman 1.0

bytedance/omnihuman-1-0

Single character

Fabric 1.0

falai/fabric-1-0

Image talks with lipsync

PixVerse Lipsync

falai/pixverse-lipsync

Realistic lipsync

Video Editing

Model

App ID

Best For

HappyHorse Edit

alibaba/happyhorse-1-0-video-edit

Natural language video editing

Utilities

Tool

App ID

Description

HunyuanVideo Foley

infsh/hunyuanvideo-foley

Add sound effects to video

Topaz Upscaler

falai/topaz-video-upscaler

Upscale video quality

Media Merger

infsh/media-merger

Merge videos with transitions

Browse All Video Apps

belt app store --category video

Examples

Text-to-Video with Veo

belt app run google/veo-3-1-fast --input '{

  "prompt": "A timelapse of a flower blooming in a garden"

}'

Grok Video

belt app run xai/grok-imagine-video --input '{

  "prompt": "Waves crashing on a beach at sunset",

  "duration": 5

}'

Image-to-Video with Wan 2.5

belt app run falai/wan-2-5 --input '{

  "image_url": "https://your-image.jpg"

}'

AI Avatar / Talking Head

belt app run bytedance/omnihuman-1-5 --input '{

  "image_url": "https://portrait.jpg",

  "audio_url": "https://speech.mp3"

}'

Fabric Lipsync

belt app run falai/fabric-1-0 --input '{

  "image_url": "https://face.jpg",

  "audio_url": "https://audio.mp3"

}'

Seedance 2.0 Text-to-Video with Audio

belt app run bytedance/seedance-2-0 --input '{

  "prompt": "a jazz band performing in a dimly lit club",

  "generate_audio": true,

  "duration": 10

}'

Seedance 2.0 Image-to-Video

belt app run bytedance/seedance-2-0 --input '{

  "image": "https://your-image.jpg",

  "prompt": "gentle camera movement, leaves rustling in the wind",

  "generate_audio": true

}'

Seedance 2.0 Reference-to-Video

belt app run bytedance/seedance-2-0 --input '{

  "prompt": "A person who looks like the reference walking through a garden",

  "reference_image": "https://portrait.jpg",

  "generate_audio": true

}'

HappyHorse Text-to-Video

belt app run alibaba/happyhorse-1-0-t2v --input '{

  "prompt": "a golden retriever running through autumn leaves, slow motion",

  "duration": 10,

  "resolution": "1080P"

}'

HappyHorse Video Editing

belt app run alibaba/happyhorse-1-0-video-edit --input '{

  "video": "https://your-video.mp4",

  "prompt": "change the background to a snowy mountain landscape"

}'

PixVerse Lipsync

belt app run falai/pixverse-lipsync --input '{

  "image_url": "https://portrait.jpg",

  "audio_url": "https://speech.mp3"

}'

Video Upscaling

belt app run falai/topaz-video-upscaler --input '{"video_url": "https://..."}'

Add Sound Effects (Foley)

belt app run infsh/hunyuanvideo-foley --input '{

  "video_url": "https://silent-video.mp4",

  "prompt": "footsteps on gravel, birds chirping"

}'

Merge Videos

belt app run infsh/media-merger --input '{

  "videos": ["https://clip1.mp4", "https://clip2.mp4"],

  "transition": "fade"

}'

Related Skills

# Full platform skill (all 250+ apps)

npx skills add inference-sh/skills@infsh-cli

# Pruna P-Video (fast & economical)

npx skills add inference-sh/skills@p-video

# Google Veo specific

npx skills add inference-sh/skills@google-veo

# Seedance 2.0

npx skills add inference-sh/skills@seedance

# HappyHorse 1.0

npx skills add inference-sh/skills@happyhorse

# AI avatars & lipsync

npx skills add inference-sh/skills@ai-avatar-video

# Text-to-speech (for video narration)

npx skills add inference-sh/skills@text-to-speech

# Image generation (for image-to-video)

npx skills add inference-sh/skills@ai-image-generation

# Twitter (post videos)

npx skills add inference-sh/skills@twitter-automation

Browse all apps: belt app store

Documentation

BrowserAct

Let your agent run on any real-world website

Bypass CAPTCHA & anti-bot for free. Start local, scale to cloud.

Explore BrowserAct Skills →

Stop writing automation&scrapers

Install the CLI. Run your first Skill in 30 seconds. Scale when you're ready.

Start free
free · no credit card