SKILL.md

$27

Step 1: Determine the Image Requirements

Before generating, clarify only what is necessary:

What to illustrate — the concept, architecture, flow, or scene

Language — default to English for both prompt and text in image. Only use another language if the user explicitly requests it

Save location — see "Output Path" below

Style/color preferences — if user has specific needs, use them; otherwise use the default style

Step 2: Select the Provider Path

Path A: Codex Built-In

Use this path when:

The current agent is Codex

The built-in image_gen tool is available

The user did not explicitly request API/CLI execution

Read references/codex-built-in.md, generate with the built-in tool, then move/copy the final image into the workspace if it is project-bound.

Path B: Script Auto Fallback

Use this path when:

The current agent is not Codex

The built-in tool is unavailable

The user explicitly asks for API/CLI execution

Run:

python <skill-root>/scripts/generate_image.py \

  --prompt "your prompt here" \

  --output "/path/to/save/image.png"

The script uses --provider auto by default:

Try OpenAI API when OPENAI_API_KEY is set

If OpenAI API fails or is not configured, try Gemini when GEMINI_API_KEY is set

If neither credential is available, report the missing environment variables

Step 3: Craft the Prompt

Default Style Prefix

The script automatically prepends this style prefix unless --style-prefix or --no-style is used:

Use a clean, modern color palette with soft tones. Minimalist flat illustration style with clear visual hierarchy. Professional and polished look suitable for technical blog articles. No photorealistic rendering. No excessive gradients or shadows.

For the Codex built-in path, include the same style guidance directly in the prompt unless the user requested a different style.

Prompt Writing Guidelines

Be specific about visual elements, relationships, and layout

For technical concepts: describe the components and how they connect

For architecture diagrams: list the layers/components and data flow direction

For flow diagrams: describe the steps and direction of flow

If text labels are needed in the image, spell them out explicitly and keep text short

Default language is English; use another language only when requested

Example Prompts

Architecture diagram:

A system architecture diagram showing: User sends query to an API Gateway,

which routes to a Vector Database labeled "Milvus" and a generation service.

The Vector Database returns relevant documents, which are combined with the

original query and sent to the generation service for final response generation.

Arrows show data flow direction. Each component is a rounded rectangle with

an icon and label.

Concept illustration:

A visual comparison of keyword search vs semantic search. Left side shows

keyword search with exact word matching and highlighted matching words.

Right side shows semantic search with a brain icon understanding meaning

and connecting related concepts with dotted lines. A dividing line separates

the two approaches.

Step 4: Parameters

Default Parameters

Parameter

Default

Notes

Provider

auto in script; Codex built-in when available

Codex built-in first, then OpenAI API, then Gemini

OpenAI model

gpt-image-2

Used by script fallback

Gemini model

gemini-3.1-flash-image-preview

Used by script fallback

Aspect ratio

3:2

Landscape, ideal for article illustrations

Image size

1K

Good balance of quality and cost

Style

Minimal, clean, soft tones

Auto-prepended by script

Language

English

Prompt and in-image text

Script Options

--provider          auto, openai, gemini

--model             Provider model ID for the selected provider

--openai-model      OpenAI model ID, default gpt-image-2

--gemini-model      Gemini model ID, default gemini-3.1-flash-image-preview

--aspect-ratio      1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 9:16, 16:9, 21:9, etc.

--image-size        512, 1K, 2K, 4K

--openai-quality    low, medium, high, auto

--style-prefix      Custom style prefix

--no-style          Skip default style prefix

When to Change Defaults

Scenario

Change

Higher quality final asset

--image-size 2K or --openai-quality high

Social media banner

--aspect-ratio 16:9

Portrait/vertical image

--aspect-ratio 3:4 or --aspect-ratio 9:16

Square image

--aspect-ratio 1:1

User has their own style

--style-prefix "your style" or --no-style

Non-English content

Write prompt in target language

Step 5: Determine Output Path

Follow this priority order:

Priority 1: Context from Current Conversation

If the user is working on a specific markdown file or article:

Check where existing images in that article are stored by looking for image references in the .md file

Save the new image in the same directory as the existing images

Use a descriptive filename that matches the existing naming convention

Example: if the article has ![](images/architecture-overview.png), save to the same images/ directory.

Priority 2: Project Image Directory

If no specific article context but working within a project:

Look for existing image directories: images/, assets/, static/, img/, figures/

Save in the most appropriate existing directory

If none exists, create an images/ directory at the project root or under the relevant content directory

Priority 3: Fallback

If no clear project context:

Save to the current working directory

Use a descriptive filename: concept-name-illustration.png

Step 6: Verify the Result

After generating:

Read the image file to visually verify it matches the user's request

If the result is not satisfactory, refine the prompt and regenerate once with targeted changes

If the image will be inserted into a markdown file, suggest the markdown syntax: ![alt text](relative/path/to/image.png)

Report which provider path was used and where the final file was saved

image-generation

SKILL.md

Step 1: Determine the Image Requirements

Step 2: Select the Provider Path

Path A: Codex Built-In

Path B: Script Auto Fallback

Step 3: Craft the Prompt

Default Style Prefix

Prompt Writing Guidelines

Example Prompts

Step 4: Parameters

Default Parameters

Script Options

When to Change Defaults

Step 5: Determine Output Path

Priority 1: Context from Current Conversation

Priority 2: Project Image Directory

Priority 3: Fallback

Step 6: Verify the Result

Stop writing automation&scrapers

image-generation

SKILL.md

Step 1: Determine the Image Requirements

Step 2: Select the Provider Path

Path A: Codex Built-In

Path B: Script Auto Fallback

Step 3: Craft the Prompt

Default Style Prefix

Prompt Writing Guidelines

Example Prompts

Step 4: Parameters

Default Parameters

Script Options

When to Change Defaults

Step 5: Determine Output Path

Priority 1: Context from Current Conversation

Priority 2: Project Image Directory

Priority 3: Fallback

Step 6: Verify the Result

Let your agent run on any real-world website

Related skills

Stop writing automation&scrapers