runcomfy-cli
🎯Skillfrom agentspace-so/runcomfy-agent-skills
The foundation skill for the RunComfy platform, providing a single CLI to install, authenticate, and invoke hundreds of model endpoints including image generation, video, face-swap, lip-sync, and LoRA training.
Overview
RunComfy CLI is the foundation command-line tool for the RunComfy platform. It provides a single binary (runcomfy) that handles installation, authentication, model discovery, and invocation of hundreds of AI model endpoints. After installing via npm and signing in once with runcomfy login, you can call any text-to-image, video generation, image editing, lip-sync, face-swap, LoRA training, or other model endpoint using runcomfy run <model_id> --input '{...}'. The CLI supports polling for async results, streaming output, JSON output mode for scripting, and automatic error handling.
Key Features
- Single auth, hundreds of models - Install once with
npm i -g @runcomfy/cli, sign in once withruncomfy login, and access every model endpoint on the RunComfy platform including image generation, video generation, editing, lip-sync, face-swap, ControlNet, relighting, upscaling, and LoRA training. - Flexible execution modes - Run jobs with automatic polling (
--poll), streaming output, fire-and-forget (--no-wait), or JSON output mode (--json) for CI/CD and scripting pipelines. - Model discovery and schema inspection - Browse available models with
runcomfy models, inspect input schemas withruncomfy schema <model_id>, and discover endpoints without leaving the terminal. - CI/container-friendly auth - Use
RUNCOMFY_TOKENenvironment variable for headless environments, or interactive browser-based login for development machines. Tokens are stored securely at~/.config/runcomfy/token.jsonwith mode 0600.
Who is this for?
- Developers building AI-powered image and video pipelines who want CLI-level control over model invocation without managing individual provider APIs
- Content creators and automation engineers who need scriptable access to hundreds of generative AI models through a single authenticated interface
- Teams integrating generative AI into CI/CD workflows, batch processing scripts, or agent-driven automation systems
Same repository
agentspace-so/runcomfy-agent-skills(30 items)
Installation
npx vibeindex add agentspace-so/runcomfy-agent-skills --skill runcomfy-clinpx skills add agentspace-so/runcomfy-agent-skills --skill runcomfy-cli~/.claude/skills/runcomfy-cli/SKILL.mdSKILL.md
More from this repository10
A smart intent-routing skill for video editing on RunComfy that selects the best model based on the user's intent. Routes to Wan 2.7 Edit-Video for restyle and background swaps, Kling 2.6 Pro for precise motion transfer, or Lucy Edit for lightweight identity-stable restyle and outfit swaps.
A smart intent-routing skill for image-to-video generation on RunComfy that automatically selects the best model for the task. Routes to HappyHorse 1.0 I2V for general animations, Wan 2.7 for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal composition from image, video, and audio references.
A smart intent-routing skill for image editing on RunComfy that selects the best model based on the editing task. Routes to Nano Banana Edit for batch edits up to 20 images, GPT Image 2 for multilingual text rewrite, Flux Kontext Pro for single-shot precise edits, or Z-Image Turbo for mask-driven inpainting.
Edit images with Black Forest Labs' Flux 1 Kontext Pro on RunComfy, specializing in single-reference precise local edits with high-fidelity source preservation. Ideal for targeted changes like adding objects or modifying details while keeping the rest of the image unchanged.
A RunComfy skill that generates images using Google Nano Banana 2, the flash-tier text-to-image model in the Gemini family. Optimized for rapid iteration, social thumbnails, and in-image typography with configurable resolution tiers and safety tolerance.
Edit images with Google Nano Banana 2 on RunComfy, supporting batch edits of up to 20 images per call with strong identity preservation. Features localized edits using spatial language, background swaps, and configurable resolution up to 4K.
Generate text-to-video with HappyHorse 1.0 on RunComfy, currently ranked #1 on Artificial Analysis Video Arena. Supports native 1080p with in-pass synchronized audio, multi-shot character consistency, and 6-language prompt support via the RunComfy CLI.
Generate text-to-video with Wan-AI's Wan 2.7 on RunComfy, featuring multi-reference conditioning and audio-driven lip-sync via custom audio tracks. Supports prompt expansion, negative prompts, and up to 1080p resolution through the RunComfy CLI.
Generate cinematic short-form video with ByteDance Seedance 2.0 Pro on RunComfy, supporting multi-modal references including up to 9 images, 3 videos, and 3 audio tracks. Features native lip-synced audio generation and is ideal for brand-consistent multi-language narratives.
Edit images with OpenAI GPT Image 2 on RunComfy, excelling at multilingual in-image text editing across any script (Latin, kana, CJK, Cyrillic, Arabic) and multi-reference composition with up to 10 input images. Ideal for identity-preserving edits and layout-precise repositioning.