🎯

baoyu-image-gen

🎯Skill

from jimliu/baoyu-skills

What it does

AI SDK-based image generation skill supporting OpenAI, Google, and DashScope providers with text-to-image generation, reference images, aspect ratio control, and quality presets through a unified slash command interface.

Overview

Baoyu Image Gen is an AI SDK-based image generation skill for Claude Code that supports multiple providers including OpenAI, Google, and DashScope (Aliyun Tongyi Wanxiang). It enables text-to-image generation with features like reference images, aspect ratio control, and quality presets, all accessible through a simple slash command interface. Part of the jimliu/baoyu-skills collection, it provides a unified interface across different AI image generation backends.

Key Features

Multi-Provider Support: Works with OpenAI, Google, and DashScope (Aliyun Tongyi Wanxiang) APIs with automatic provider detection based on available API keys
Flexible Image Options: Supports custom aspect ratios (e.g., 16:9, 1:1, 4:3), quality presets (normal and 2k), and specific output sizes
Reference Image Support: Google multimodal mode allows using reference images to guide generation (e.g., "Make it blue" with a source image)
Prompt File Input: Can read prompts from files using --promptfiles, enabling complex multi-file prompt compositions
Customizable Endpoints: Supports custom base URLs for each provider, allowing use with compatible API proxies

Who is this for?

Developers who want to generate images directly from Claude Code without switching to external tools or web interfaces
Content creators and designers who need quick image generation with fine-grained control over providers, aspect ratios, and quality levels
Teams working with Chinese AI platforms (DashScope/Tongyi Wanxiang) who need a unified interface alongside OpenAI and Google providers

📦

Same repository

jimliu/baoyu-skills(29 items)

baoyu-image-gen

Installation

Vibe Index InstallInstalls to .claude/skills/

npx vibeindex add jimliu/baoyu-skills --skill baoyu-image-gen

skills.sh Install⚠ Installs to .agents/skills/

npx skills add jimliu/baoyu-skills --skill baoyu-image-gen

Manual InstallCopy SKILL.md content and save to the path below

~/.claude/skills/baoyu-image-gen/SKILL.md

SKILL.md

26,251Installs

10,234

Last UpdatedMar 22, 2026

View on GitHub Back to Skills

More from this repository10

🎯

baoyu-post-to-wechat🎯Skill

A Claude Code skill for publishing content to WeChat Official Accounts, supporting article posting via API/browser and image-text posting with up to 9 images, with configurable preferences and a structured 7-step publishing workflow.

🎯

baoyu-markdown-to-html🎯Skill

Converts Markdown files to clean, semantic HTML with support for custom styles, code highlighting, and responsive rendering.

🎯

baoyu-cover-image🎯Skill

A Claude Code skill that generates article cover images using a 5-dimensional system (Type, Palette, Rendering, Text, Mood) with 9 color palettes and 6 rendering styles for 54 unique combinations.

🎯

baoyu-infographic🎯Skill

Generates professional infographics by combining 20 layout types (bento-grid, funnel, timeline, etc.) with 17 visual styles (craft-handmade, cyberpunk, pixel-art, etc.), with auto-recommended combinations based on content analysis.

🎯

baoyu-slide-deck🎯Skill

A Claude Code skill that transforms written content into professional slide deck images with 16 visual style presets, smart content scaling, and multi-language support.

🎯

baoyu-article-illustrator🎯Skill

A Claude Code skill that analyzes articles, identifies optimal illustration positions, and generates images using a Type x Style two-dimensional system with 6 types and multiple visual styles.

🎯

baoyu-xhs-images🎯Skill

A Claude Code skill that generates Xiaohongshu (RedNote) infographic card series from content using a Style x Layout 2D system with 9 visual styles and 6 layout types.

🎯

baoyu-url-to-markdown🎯Skill

Fetches any URL via Chrome CDP with full JavaScript rendering and converts it to clean markdown with metadata. Supports auto-capture and wait mode for login-required or dynamically loaded pages.

🎯

baoyu-comic🎯Skill

A Claude Code skill for creating educational and narrative comics with 5 art styles, 7 tones, 3 presets, and 6 layout options, generating detailed panel layouts and sequential images from source material or text input.

🎯

baoyu-post-to-x🎯Skill

Agent skill for posting text, images, and long-form Markdown articles to X (Twitter) directly from Claude Code using Chrome CDP, part of Baoyu's cross-platform content publishing suite.