🎯

baoyu-image-gen

🎯Skill

from yelban/baoyu-skills.tw

What it does

Generates AI images using OpenAI, Google, and DashScope APIs with flexible text-to-image generation options.

📦

Part of

yelban/baoyu-skills.tw(16 items)

baoyu-image-gen

Installation

npxRun with npx

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image cat.png

npxRun with npx

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A landscape" --image out.png --ar 16:9

npxRun with npx

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --quality 2k

npxRun with npx

npx -y bun ${SKILL_DIR}/scripts/main.ts --promptfiles system.md content.md --image out.png

npxRun with npx

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "Make blue" --image out.png --ref source.png

+ 2 more commands

📖 Extracted from docs: yelban/baoyu-skills.tw

Need more details? View full documentation on GitHub →

2Installs

AddedFeb 4, 2026

View on GitHub Back to Skills

Skill Details

SKILL.md

AI image generation with OpenAI, Google and DashScope APIs. Supports text-to-image, reference images, aspect ratios. Sequential by default; parallel generation available on request. Use when user asks to generate, create, or draw images.

Overview

# Image Generation (AI SDK)

Official API-based image generation. Supports OpenAI, Google and DashScope (阿里通義永珍) providers.

Script Directory

Agent Execution:

SKILL_DIR = this SKILL.md file's directory
Script path = ${SKILL_DIR}/scripts/main.ts

Preferences (EXTEND.md)

Use Bash to check EXTEND.md existence (priority order):

```bash

# Check project-level first

test -f .baoyu-skills/baoyu-image-gen/EXTEND.md && echo "project"

# Then user-level (cross-platform: $HOME works on macOS/Linux/WSL)

test -f "$HOME/.baoyu-skills/baoyu-image-gen/EXTEND.md" && echo "user"

```

┌──────────────────────────────────────────────────┬───────────────────┐

│ Path │ Location │

├──────────────────────────────────────────────────┼───────────────────┤

│ .baoyu-skills/baoyu-image-gen/EXTEND.md │ Project directory │

│ $HOME/.baoyu-skills/baoyu-image-gen/EXTEND.md │ User home │

└──────────────────────────────────────────────────┴───────────────────┘

┌───────────┬───────────────────────────────────────────────────────────────────────────┐

│ Result │ Action │

├───────────┼───────────────────────────────────────────────────────────────────────────┤

│ Found │ Read, parse, apply settings │

│ Not found │ Use defaults │

└───────────┴───────────────────────────────────────────────────────────────────────────┘

EXTEND.md Supports: Default provider | Default quality | Default aspect ratio

Usage

```bash

# Basic

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image cat.png

# With aspect ratio

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A landscape" --image out.png --ar 16:9

# High quality

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --quality 2k

# From prompt files

npx -y bun ${SKILL_DIR}/scripts/main.ts --promptfiles system.md content.md --image out.png

# With reference images (Google multimodal only)

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "Make blue" --image out.png --ref source.png

# Specific provider

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider openai

# DashScope (阿里通義永珍)

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "一隻可愛的貓" --image out.png --provider dashscope

```

Options

| Option | Description |

|--------|-------------|

| --prompt , -p | Prompt text |

| --promptfiles | Read prompt from files (concatenated) |

| --image | Output image path (required) |

| --model , -m | Model ID |

| --ar | Aspect ratio (e.g., 16:9, 1:1, 4:3) |

| --size | Size (e.g., 1024x1024) |

| --quality normal\|2k | Quality preset (default: 2k) |

| --imageSize 1K\|2K\|4K | Image size for Google (default: from quality) |

| --ref | Reference images (Google multimodal only) |

| --n | Number of images |

| --json | JSON output |

Environment Variables

| Variable | Description |

|----------|-------------|

| OPENAI_API_KEY | OpenAI API key |

| GOOGLE_API_KEY | Google API key |

| DASHSCOPE_API_KEY | DashScope API key (阿里雲) |

| OPENAI_IMAGE_MODEL | OpenAI model override |

| GOOGLE_IMAGE_MODEL | Google model override |

| DASHSCOPE_IMAGE_MODEL | DashScope model override (default: z-image-turbo) |

| OPENAI_BASE_URL | Custom OpenAI endpoint |

| GOOGLE_BASE_URL | Custom Google endpoint |

| DASHSCOPE_BASE_URL | Custom DashScope endpoint |

Load Priority: CLI args > env vars > /.baoyu-skills/.env > ~/.baoyu-skills/.env

Provider Selection

--provider specified → use it
Only one API key available → use that provider
Multiple available → default to Google

Quality Presets

|--------|------------------|-------------|----------|

| normal | 1K | 1024px | Quick previews |

| 2k (default) | 2K | 2048px | Covers, illustrations, infographics |

Google imageSize: Can be overridden with --imageSize 1K|2K|4K

Aspect Ratios

Supported: 1:1, 16:9, 9:16, 4:3, 3:4, 2.35:1

Google multimodal: uses imageConfig.aspectRatio
Google Imagen: uses aspectRatio parameter
OpenAI: maps to closest supported size

Generation Mode

Default: Sequential generation (one image at a time). This ensures stable output and easier debugging.

Parallel Generation: Only use when user explicitly requests parallel/concurrent generation.

| Mode | When to Use |

|------|-------------|

| Sequential (default) | Normal usage, single images, small batches |

| Parallel | User explicitly requests, large batches (10+) |

Parallel Settings (when requested):

| Setting | Value |

|---------|-------|

| Recommended concurrency | 4 subagents |

| Max concurrency | 8 subagents |

| Use case | Large batch generation when user requests parallel |

Agent Implementation (parallel mode only):

```

# Launch multiple generations in parallel using Task tool

# Each Task runs as background subagent with run_in_background=true

# Collect results via TaskOutput when all complete

```

Error Handling

Missing API key → error with setup instructions
Generation failure → auto-retry once
Invalid aspect ratio → warning, proceed with default
Reference images with non-multimodal model → warning, ignore refs

Extension Support

Custom configurations via EXTEND.md. See Preferences section for paths and supported options.

More from this repository10

🎯

baoyu-cover-image🎯Skill

Generates customizable article cover images with 5-dimensional design options, supporting multiple styles, palettes, and aspect ratios.

🎯

baoyu-article-illustrator🎯Skill

Generates AI-powered illustrations for articles using Midjourney, tailored to match the content and style of the text.

🎯

baoyu-xhs-images🎯Skill

Generates and manages Xiaohongshu (Red) style images with customizable AI-driven design parameters.

🎯

baoyu-slide-deck🎯Skill

Generates professional slide deck images from content, creating stylized presentations with customizable options for audience, style, and language.

🎯

baoyu-post-to-x🎯Skill

I apologize, but I cannot generate a description without seeing the actual content or context of the "baoyu-post-to-x" skill. Could you provide more details about what this specific skill does? Wit...

🎯

release-skills🎯Skill

Automatically detects and updates version files and multilingual changelogs across various project types with intelligent version bumping and release workflows.

🎯

baoyu-post-to-wechat🎯Skill

Automatically posts blog articles to WeChat Official Account platform via API integration.

🎯

baoyu-danger-gemini-web🎯Skill

Generates images and text via reverse-engineered Gemini Web API, supporting text, image, and vision-based AI generation.

🎯

baoyu-danger-x-to-markdown🎯Skill

Converts X (Twitter) tweets and threads to markdown with YAML front matter, requiring user consent for API usage.

🎯

baoyu-comic🎯Skill

I apologize, but I cannot generate a description without seeing the actual content or context of the "baoyu-comic" skill. Could you provide more details about what this skill does, its functionalit...