π―
image-generation
π―Skillfrom xiangyu-cas/vision-skills
What it does
Generates, edits, and composes images using Google's Gemini AI with text prompts, multi-reference inputs, and search grounding.
Part of
xiangyu-cas/vision-skills(3 items)
image-generation
Installation
git cloneClone repository
git clone https://github.com/Xiangyu-CAS/Vision-Skills.gitπ Extracted from docs: xiangyu-cas/vision-skills
3Installs
-
AddedFeb 4, 2026
Skill Details
SKILL.md
Gemini image generation and editing skill for text-to-image, image-to-image edits, multi-reference composition, and Google Search grounding. Use when creating or modifying images via Gemini (default model gemini-3-pro-image-preview) with the Python SDK.
More from this repository2
π―π―
video-generationπ―Skill
Generates high-quality videos from text or images using Gemini's Veo 3.1, with customizable parameters like resolution, duration, and interpolation.
bbdown-cliπ―Skill
Downloads Bilibili videos via CLI, supporting 720p preference, authentication methods, and flexible output configuration.