imagen

|

INSTALLATION
npx skills add https://github.com/sanjay3290/ai-skills --skill imagen
Run in your project or agent environment. Adjust flags if your CLI version differs.

SKILL.md

Imagen - AI Image Generation Skill

Overview

This skill generates images using Google Gemini's image generation model (gemini-3-pro-image-preview). It enables seamless image creation during any Claude Code session - whether you're building frontend UIs, creating documentation, or need visual representations of concepts.

Cross-Platform: Works on Windows, macOS, and Linux.

When to Use This Skill

Automatically activate this skill when:

  • User requests image generation (e.g., "generate an image of...", "create a picture...")
  • Frontend development requires placeholder or actual images
  • Documentation needs illustrations or diagrams
  • Visualizing concepts, architectures, or ideas
  • Creating icons, logos, or UI assets
  • Any task where an AI-generated image would be helpful

How It Works

  • Takes a text prompt describing the desired image
  • Calls Google Gemini API with image generation configuration
  • Saves the generated image to a specified location (defaults to current directory)
  • Returns the file path for use in your project

Usage

Python (Cross-Platform - Recommended)

# Basic usage

python scripts/generate_image.py "A futuristic city skyline at sunset"

# With custom output path

python scripts/generate_image.py "A minimalist app icon for a music player" "./assets/icons/music-icon.png"

# With custom size

python scripts/generate_image.py --size 2K "High resolution landscape" "./wallpaper.png"

Requirements

  • GEMINI_API_KEY environment variable must be set
  • Python 3.6+ (uses standard library only, no pip install needed)

Output

Generated images are saved as PNG files. The script returns:

  • Success: Path to the generated image
  • Failure: Error message with details

Examples

Frontend Development

User: "I need a hero image for my landing page - something abstract and tech-focused"

-> Generates and saves image, provides path for use in HTML/CSS

Documentation

User: "Create a diagram showing microservices architecture"

-> Generates visual representation, ready for README or docs

UI Assets

User: "Generate a placeholder avatar image for the user profile component"

-> Creates image in appropriate size for component use
BrowserAct

Let your agent run on any real-world website

Bypass CAPTCHA & anti-bot for free. Start local, scale to cloud.

Explore BrowserAct Skills →

Stop writing automation&scrapers

Install the CLI. Run your first Skill in 30 seconds. Scale when you're ready.

Start free
free · no credit card