ai-content-pipeline

Build multi-step AI content creation pipelines combining image, video, audio, and text. Workflow examples: generate image -> animate -> add voiceover -> merge…

INSTALLATION
npx skills add https://github.com/inference-sh/skills --skill ai-content-pipeline
Run in your project or agent environment. Adjust flags if your CLI version differs.

SKILL.md

$28

Pipeline Patterns

Pattern 1: Image -> Video -> Audio

[FLUX Image] -> [Wan 2.5 Video] -> [Foley Sound]

Pattern 2: Script -> Speech -> Avatar

[LLM Script] -> [Kokoro TTS] -> [OmniHuman Avatar]

Pattern 3: Research -> Content -> Distribution

[Tavily Search] -> [Claude Summary] -> [FLUX Visual] -> [Twitter Post]

Complete Workflows

YouTube Short Pipeline

Create a complete short-form video from a topic.

# 1. Generate script with Claude

belt app run openrouter/claude-sonnet-45 --input '{

  "prompt": "Write a 30-second script about the future of AI. Make it engaging and conversational. Just the script, no stage directions."

}' > script.json

# 2. Generate voiceover with Kokoro

belt app run infsh/kokoro-tts --input '{

  "prompt": "<script-text>",

  "voice": "af_sarah"

}' > voice.json

# 3. Generate background image with FLUX

belt app run falai/flux-dev --input '{

  "prompt": "Futuristic city skyline at sunset, cyberpunk aesthetic, 4K wallpaper"

}' > background.json

# 4. Animate image to video with Wan

belt app run falai/wan-2-5 --input '{

  "image_url": "<background-url>",

  "prompt": "slow camera pan across cityscape, subtle movement"

}' > video.json

# 5. Add captions (manually or with another tool)

# 6. Merge video with audio

belt app run infsh/media-merger --input '{

  "video_url": "<video-url>",

  "audio_url": "<voice-url>"

}'

Talking Head Video Pipeline

Create an AI avatar presenting content.

# 1. Write the script

belt app run openrouter/claude-sonnet-45 --input '{

  "prompt": "Write a 1-minute explainer script about quantum computing for beginners."

}' > script.json

# 2. Generate speech

belt app run infsh/kokoro-tts --input '{

  "prompt": "<script>",

  "voice": "am_michael"

}' > speech.json

# 3. Generate or use a portrait image

belt app run falai/flux-dev --input '{

  "prompt": "Professional headshot of a friendly tech presenter, neutral background, looking at camera"

}' > portrait.json

# 4. Create talking head video

belt app run bytedance/omnihuman-1-5 --input '{

  "image_url": "<portrait-url>",

  "audio_url": "<speech-url>"

}' > talking_head.json

Product Demo Pipeline

Create a product showcase video.

# 1. Generate product image

belt app run falai/flux-dev --input '{

  "prompt": "Sleek wireless earbuds on white surface, studio lighting, product photography"

}' > product.json

# 2. Animate product reveal

belt app run falai/wan-2-5 --input '{

  "image_url": "<product-url>",

  "prompt": "slow 360 rotation, smooth motion"

}' > product_video.json

# 3. Upscale video quality

belt app run falai/topaz-video-upscaler --input '{

  "video_url": "<product-video-url>"

}' > upscaled.json

# 4. Add background music

belt app run infsh/media-merger --input '{

  "video_url": "<upscaled-url>",

  "audio_url": "https://your-music.mp3",

  "audio_volume": 0.3

}'

Blog to Video Pipeline

Convert written content to video format.

# 1. Summarize blog post

belt app run openrouter/claude-haiku-45 --input '{

  "prompt": "Summarize this blog post into 5 key points for a video script: <blog-content>"

}' > summary.json

# 2. Generate images for each point

for i in 1 2 3 4 5; do

  belt app run falai/flux-dev --input "{

    \"prompt\": \"Visual representing point $i: <point-text>\"

  }" > "image_$i.json"

done

# 3. Animate each image

for i in 1 2 3 4 5; do

  belt app run falai/wan-2-5 --input "{

    \"image_url\": \"<image-$i-url>\"

  }" > "video_$i.json"

done

# 4. Generate voiceover

belt app run infsh/kokoro-tts --input '{

  "prompt": "<full-script>",

  "voice": "bf_emma"

}' > narration.json

# 5. Merge all clips

belt app run infsh/media-merger --input '{

  "videos": ["<video1>", "<video2>", "<video3>", "<video4>", "<video5>"],

  "audio_url": "<narration-url>",

  "transition": "crossfade"

}'

Pipeline Building Blocks

Content Generation

Step

App

Purpose

Script

openrouter/claude-sonnet-45

Write content

Research

tavily/search-assistant

Gather information

Summary

openrouter/claude-haiku-45

Condense content

Visual Assets

Step

App

Purpose

Image

falai/flux-dev

Generate images

Image

google/imagen-3

Alternative image gen

Upscale

falai/topaz-image-upscaler

Enhance quality

Animation

Step

App

Purpose

I2V

falai/wan-2-5

Animate images

T2V

google/veo-3-1-fast

Generate from text

Avatar

bytedance/omnihuman-1-5

Talking heads

Audio

Step

App

Purpose

TTS

infsh/kokoro-tts

Voice narration

Music

infsh/ai-music

Background music

Foley

infsh/hunyuanvideo-foley

Sound effects

Post-Production

Step

App

Purpose

Upscale

falai/topaz-video-upscaler

Enhance video

Merge

infsh/media-merger

Combine media

Caption

infsh/caption-video

Add subtitles

Best Practices

  • Plan the pipeline first - Map out each step before running
  • Save intermediate results - Store outputs for iteration
  • Use appropriate quality - Fast models for drafts, quality for finals
  • Match resolutions - Keep consistent aspect ratios throughout
  • Test each step - Verify outputs before proceeding

Related Skills

# Video generation models

npx skills add inference-sh/skills@ai-video-generation

# Image generation

npx skills add inference-sh/skills@ai-image-generation

# Text-to-speech

npx skills add inference-sh/skills@text-to-speech

# LLM models for scripts

npx skills add inference-sh/skills@llm-models

# Full platform skill

npx skills add inference-sh/skills@infsh-cli

Browse all apps: belt app store

Documentation

BrowserAct

Let your agent run on any real-world website

Bypass CAPTCHA & anti-bot for free. Start local, scale to cloud.

Explore BrowserAct Skills →

Stop writing automation&scrapers

Install the CLI. Run your first Skill in 30 seconds. Scale when you're ready.

Start free
free · no credit card