gpt-image-edit
๐ฏSkillfrom agentspace-so/runcomfy-agent-skills
Edit images with OpenAI GPT Image 2 on RunComfy, excelling at multilingual in-image text editing across any script (Latin, kana, CJK, Cyrillic, Arabic) and multi-reference composition with up to 10 input images. Ideal for identity-preserving edits and layout-precise repositioning.
Overview
A RunComfy skill for editing images with OpenAI GPT Image 2's edit endpoint (ChatGPT Images 2.0 image-to-image). It excels at multilingual in-image text editing across all writing systems (Latin, kana, CJK, Cyrillic, Arabic) and supports up to 10 reference images per call for multi-reference composition. The skill is strongest in class at preserving identity through targeted edits and rewriting embedded text, making it the go-to choice when typography precision and multilingual support matter.
Key Features
- Best-in-class multilingual in-image text editing across Latin, kana, CJK, Cyrillic, and Arabic scripts
- Identity preservation through targeted edits with "keep X unchanged" prompt patterns
- Up to 10 reference images per call: first image is primary, rest are auxiliary for composition cues
- Layout-precise editing: move headlines, swap CTAs, and rearrange visual elements with spatial accuracy
- Built-in routing guidance for when to use Nano Banana Edit, Flux Kontext, or GPT Image 2 text-to-image instead
Who is this for?
- Global marketing teams that need to localize ad creatives by rewriting in-image text across multiple languages and writing systems
- Brand designers creating translated headline variants while maintaining exact visual identity and layout
- E-commerce teams updating product labels, signage, and embedded text across diverse markets
Same repository
agentspace-so/runcomfy-agent-skills(30 items)
Installation
npx vibeindex add agentspace-so/runcomfy-agent-skills --skill gpt-image-editnpx skills add agentspace-so/runcomfy-agent-skills --skill gpt-image-edit~/.claude/skills/gpt-image-edit/SKILL.mdSKILL.md
More from this repository10
A smart intent-routing skill for video editing on RunComfy that selects the best model based on the user's intent. Routes to Wan 2.7 Edit-Video for restyle and background swaps, Kling 2.6 Pro for precise motion transfer, or Lucy Edit for lightweight identity-stable restyle and outfit swaps.
A smart intent-routing skill for image-to-video generation on RunComfy that automatically selects the best model for the task. Routes to HappyHorse 1.0 I2V for general animations, Wan 2.7 for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal composition from image, video, and audio references.
A smart intent-routing skill for image editing on RunComfy that selects the best model based on the editing task. Routes to Nano Banana Edit for batch edits up to 20 images, GPT Image 2 for multilingual text rewrite, Flux Kontext Pro for single-shot precise edits, or Z-Image Turbo for mask-driven inpainting.
Edit images with Black Forest Labs' Flux 1 Kontext Pro on RunComfy, specializing in single-reference precise local edits with high-fidelity source preservation. Ideal for targeted changes like adding objects or modifying details while keeping the rest of the image unchanged.
A RunComfy skill that generates images using Google Nano Banana 2, the flash-tier text-to-image model in the Gemini family. Optimized for rapid iteration, social thumbnails, and in-image typography with configurable resolution tiers and safety tolerance.
Provides Kling 3.0 video generation on RunComfy, covering all six endpoints across three quality tiers (Standard, Pro, 4K) and two modes (text-to-video, image-to-video) for Kuaishou's third-generation cinematic video model with native synchronized audio.
Generates custom Codex Pets for OpenAI Codex using RunComfy's GPT Image 2 edit endpoint. Turns a single reference image into a Codex-compatible spritesheet (1536x1872, 9 animation states) and pet.json manifest with just one API call, requiring only a RUNCOMFY_TOKEN โ no Codex Pro or OPENAI_API_KEY needed.
A Claude Code skill for generating AI videos through RunComfy CLI, supporting models like HappyHorse, Wan 2-7, Seedance, Kling, and Veo for text-to-video, image-to-video, and video-extend with automatic model selection.
A Claude Code skill that generates and edits images through RunComfy CLI, supporting 11+ AI models including FLUX 2, GPT Image 2, and Google Nano Banana with automatic model selection for text-to-image and image-to-image workflows.
The foundation skill for the RunComfy platform, providing a single CLI to install, authenticate, and invoke hundreds of model endpoints including image generation, video, face-swap, lip-sync, and LoRA training.