nano-banana

Generate images with Google Gemini native image models via inference.sh CLI. Models: Gemini 3 Pro Image, Gemini 2.5 Flash Image. Capabilities: text-to-image,…

INSTALLATION
npx skills add https://github.com/inference-sh/skills --skill nano-banana
Run in your project or agent environment. Adjust flags if your CLI version differs.

SKILL.md

$27

Models

Model

App ID

Speed

Quality

Gemini 3 Pro Image

google/gemini-3-pro-image-preview

Slower

Best

Gemini 2.5 Flash Image

google/gemini-2-5-flash-image

Fast

Excellent

Search Gemini Image Apps

belt app store search "gemini image"

Examples

Basic Text-to-Image

belt app run google/gemini-3-pro-image-preview --input '{

  "prompt": "A futuristic cityscape at sunset with flying cars"

}'

Multiple Images

belt app run google/gemini-2-5-flash-image --input '{

  "prompt": "Minimalist logo design for a coffee shop",

  "num_images": 4

}'

Custom Aspect Ratio

belt app run google/gemini-3-pro-image-preview --input '{

  "prompt": "Panoramic mountain landscape with northern lights",

  "aspect_ratio": "16:9"

}'

Image Editing (with input image)

belt app run google/gemini-2-5-flash-image --input '{

  "prompt": "Add a rainbow in the sky",

  "images": ["https://example.com/landscape.jpg"]

}'

High Resolution (4K)

belt app run google/gemini-3-pro-image-preview --input '{

  "prompt": "Detailed illustration of a medieval castle",

  "resolution": "4K"

}'

With Google Search Grounding

belt app run google/gemini-3-pro-image-preview --input '{

  "prompt": "Current weather in Tokyo visualized as an artistic scene",

  "enable_google_search": true

}'

Input Options

Parameter

Type

Description

prompt

string

Required. What to generate or change

images

array

Input images for editing (up to 14)

num_images

integer

Number of images to generate

aspect_ratio

string

Output ratio: "1:1", "16:9", "9:16", "4:3", "3:4", "auto"

resolution

string

"1K", "2K", "4K" (Gemini 3 Pro only)

output_format

string

Output format for images

enable_google_search

boolean

Enable real-time info grounding

Prompt Tips

Styles: photorealistic, illustration, watercolor, oil painting, digital art, anime, 3D render

Composition: close-up, wide shot, aerial view, macro, portrait, landscape

Lighting: natural light, studio lighting, golden hour, dramatic shadows, neon

Details: add specific details about textures, colors, mood, atmosphere

Sample Workflow

# 1. Generate sample input to see all options

belt app sample google/gemini-3-pro-image-preview --save input.json

# 2. Edit the prompt

# 3. Run

belt app run google/gemini-3-pro-image-preview --input input.json

Related Skills

# Full platform skill (all 250+ apps)

npx skills add inference-sh/skills@infsh-cli

# All image generation models

npx skills add inference-sh/skills@ai-image-generation

# Video generation (for image-to-video)

npx skills add inference-sh/skills@ai-video-generation

Browse all image apps: belt app store --category image

Documentation

BrowserAct

Let your agent run on any real-world website

Bypass CAPTCHA & anti-bot for free. Start local, scale to cloud.

Explore BrowserAct Skills →

Stop writing automation&scrapers

Install the CLI. Run your first Skill in 30 seconds. Scale when you're ready.

Start free
free · no credit card