🎯

baoyu-image-gen

🎯Skill

from yelban/baoyu-skills.tw

VibeIndex|
What it does

Generates AI images using OpenAI, Google, and DashScope APIs with flexible text-to-image generation options.

πŸ“¦

Part of

yelban/baoyu-skills.tw(16 items)

baoyu-image-gen

Installation

npxRun with npx
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image cat.png
npxRun with npx
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A landscape" --image out.png --ar 16:9
npxRun with npx
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --quality 2k
npxRun with npx
npx -y bun ${SKILL_DIR}/scripts/main.ts --promptfiles system.md content.md --image out.png
npxRun with npx
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "Make blue" --image out.png --ref source.png

+ 2 more commands

πŸ“– Extracted from docs: yelban/baoyu-skills.tw
2Installs
-
AddedFeb 4, 2026

Skill Details

SKILL.md

AI image generation with OpenAI, Google and DashScope APIs. Supports text-to-image, reference images, aspect ratios. Sequential by default; parallel generation available on request. Use when user asks to generate, create, or draw images.

Overview

# Image Generation (AI SDK)

Official API-based image generation. Supports OpenAI, Google and DashScope (ι˜Ώι‡Œι€šηΎ©ζ°Έη) providers.

Script Directory

Agent Execution:

  1. SKILL_DIR = this SKILL.md file's directory
  2. Script path = ${SKILL_DIR}/scripts/main.ts

Preferences (EXTEND.md)

Use Bash to check EXTEND.md existence (priority order):

```bash

# Check project-level first

test -f .baoyu-skills/baoyu-image-gen/EXTEND.md && echo "project"

# Then user-level (cross-platform: $HOME works on macOS/Linux/WSL)

test -f "$HOME/.baoyu-skills/baoyu-image-gen/EXTEND.md" && echo "user"

```

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”

β”‚ Path β”‚ Location β”‚

β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€

β”‚ .baoyu-skills/baoyu-image-gen/EXTEND.md β”‚ Project directory β”‚

β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€

β”‚ $HOME/.baoyu-skills/baoyu-image-gen/EXTEND.md β”‚ User home β”‚

β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”

β”‚ Result β”‚ Action β”‚

β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€

β”‚ Found β”‚ Read, parse, apply settings β”‚

β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€

β”‚ Not found β”‚ Use defaults β”‚

β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

EXTEND.md Supports: Default provider | Default quality | Default aspect ratio

Usage

```bash

# Basic

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image cat.png

# With aspect ratio

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A landscape" --image out.png --ar 16:9

# High quality

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --quality 2k

# From prompt files

npx -y bun ${SKILL_DIR}/scripts/main.ts --promptfiles system.md content.md --image out.png

# With reference images (Google multimodal only)

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "Make blue" --image out.png --ref source.png

# Specific provider

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider openai

# DashScope (ι˜Ώι‡Œι€šηΎ©ζ°Έη)

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "δΈ€ιš»ε―ζ„›ηš„θ²“" --image out.png --provider dashscope

```

Options

| Option | Description |

|--------|-------------|

| --prompt , -p | Prompt text |

| --promptfiles | Read prompt from files (concatenated) |

| --image | Output image path (required) |

| --provider google\|openai\|dashscope | Force provider (default: google) |

| --model , -m | Model ID |

| --ar | Aspect ratio (e.g., 16:9, 1:1, 4:3) |

| --size | Size (e.g., 1024x1024) |

| --quality normal\|2k | Quality preset (default: 2k) |

| --imageSize 1K\|2K\|4K | Image size for Google (default: from quality) |

| --ref | Reference images (Google multimodal only) |

| --n | Number of images |

| --json | JSON output |

Environment Variables

| Variable | Description |

|----------|-------------|

| OPENAI_API_KEY | OpenAI API key |

| GOOGLE_API_KEY | Google API key |

| DASHSCOPE_API_KEY | DashScope API key (ι˜Ώι‡Œι›²) |

| OPENAI_IMAGE_MODEL | OpenAI model override |

| GOOGLE_IMAGE_MODEL | Google model override |

| DASHSCOPE_IMAGE_MODEL | DashScope model override (default: z-image-turbo) |

| OPENAI_BASE_URL | Custom OpenAI endpoint |

| GOOGLE_BASE_URL | Custom Google endpoint |

| DASHSCOPE_BASE_URL | Custom DashScope endpoint |

Load Priority: CLI args > env vars > /.baoyu-skills/.env > ~/.baoyu-skills/.env

Provider Selection

  1. --provider specified β†’ use it
  2. Only one API key available β†’ use that provider
  3. Multiple available β†’ default to Google

Quality Presets

| Preset | Google imageSize | OpenAI Size | Use Case |

|--------|------------------|-------------|----------|

| normal | 1K | 1024px | Quick previews |

| 2k (default) | 2K | 2048px | Covers, illustrations, infographics |

Google imageSize: Can be overridden with --imageSize 1K|2K|4K

Aspect Ratios

Supported: 1:1, 16:9, 9:16, 4:3, 3:4, 2.35:1

  • Google multimodal: uses imageConfig.aspectRatio
  • Google Imagen: uses aspectRatio parameter
  • OpenAI: maps to closest supported size

Generation Mode

Default: Sequential generation (one image at a time). This ensures stable output and easier debugging.

Parallel Generation: Only use when user explicitly requests parallel/concurrent generation.

| Mode | When to Use |

|------|-------------|

| Sequential (default) | Normal usage, single images, small batches |

| Parallel | User explicitly requests, large batches (10+) |

Parallel Settings (when requested):

| Setting | Value |

|---------|-------|

| Recommended concurrency | 4 subagents |

| Max concurrency | 8 subagents |

| Use case | Large batch generation when user requests parallel |

Agent Implementation (parallel mode only):

```

# Launch multiple generations in parallel using Task tool

# Each Task runs as background subagent with run_in_background=true

# Collect results via TaskOutput when all complete

```

Error Handling

  • Missing API key β†’ error with setup instructions
  • Generation failure β†’ auto-retry once
  • Invalid aspect ratio β†’ warning, proceed with default
  • Reference images with non-multimodal model β†’ warning, ignore refs

Extension Support

Custom configurations via EXTEND.md. See Preferences section for paths and supported options.

More from this repository10

🎯
baoyu-cover-image🎯Skill

Generates customizable article cover images with 5-dimensional design options, supporting multiple styles, palettes, and aspect ratios.

🎯
baoyu-article-illustrator🎯Skill

Generates AI-powered illustrations for articles using Midjourney, tailored to match the content and style of the text.

🎯
baoyu-xhs-images🎯Skill

Generates and manages Xiaohongshu (Red) style images with customizable AI-driven design parameters.

🎯
baoyu-slide-deck🎯Skill

Generates professional slide deck images from content, creating stylized presentations with customizable options for audience, style, and language.

🎯
baoyu-post-to-x🎯Skill

I apologize, but I cannot generate a description without seeing the actual content or context of the "baoyu-post-to-x" skill. Could you provide more details about what this specific skill does? Wit...

🎯
release-skills🎯Skill

Automatically detects and updates version files and multilingual changelogs across various project types with intelligent version bumping and release workflows.

🎯
baoyu-post-to-wechat🎯Skill

Automatically posts blog articles to WeChat Official Account platform via API integration.

🎯
baoyu-danger-gemini-web🎯Skill

Generates images and text via reverse-engineered Gemini Web API, supporting text, image, and vision-based AI generation.

🎯
baoyu-danger-x-to-markdown🎯Skill

Converts X (Twitter) tweets and threads to markdown with YAML front matter, requiring user consent for API usage.

🎯
baoyu-comic🎯Skill

I apologize, but I cannot generate a description without seeing the actual content or context of the "baoyu-comic" skill. Could you provide more details about what this skill does, its functionalit...