🎯

ai-video-generation

🎯Skill

from doany-ai/skills

What it does

Generate AI videos on RunComfy through a smart router across the full video-model catalog including HappyHorse 1.0, Wan 2-7, Seedance v2, Kling 3.0, Veo 3-1, and Hailuo 2-3. Covers text-to-video, image-to-video, and video-extend, automatically selecting the best model for the user's intent.

Overview

AI Video Generation is a Claude Code skill that smart-routes video creation across the full RunComfy video-model catalog, supporting text-to-video, image-to-video, and video-extend workflows. It selects the best model for the user's intent from options including HappyHorse 1.0 (Arena #1 with native in-pass audio), Wan 2-7 (open weights with audio-driven lip-sync), Seedance v2 (multi-modal cinematic with up to 9 reference images), Kling 3.0 (4K multi-shot character identity), Veo 3-1 (physics-respecting motion), and Hailuo 2-3 (natural motion for real-world subjects). Each model ships with documented prompting patterns and exact CLI invocations.

Key Features

Intent-based model routing: Automatically classifies whether the user needs general-purpose video, audio-driven lip-sync, cinematic multi-reference composition, 4K hero shots, physics-accurate motion, or natural real-world animation, and picks the matching model
Native audio generation: HappyHorse 1.0 generates synchronized audio in-pass (describe audio inline with the prompt), while Wan 2-7 supports audio-driven lip-sync from a specific voiceover MP3
Multi-modal conditioning: Seedance v2 Pro accepts up to 9 reference images, 3 reference videos, and 3 reference audio tracks for complex cinematic compositions with lens and film language support
Text-to-video, image-to-video, and video-extend: Full coverage from prompt-only generation through still-image animation to extending existing clips via Veo 3-1's extend endpoint
Quality-cost tiers per model: Multiple tiers available (e.g., Kling 3.0 4K/Pro/Standard) so users can iterate cheaply and upgrade to production quality for final delivery

Who is this for?

This skill is for developers, content creators, and production teams who need to generate AI videos through Claude Code without manually comparing video models. It serves social media creators producing vertical clips with audio, brand teams creating cinematic ad frames, production pipelines needing dialog lip-sync from voiceover files, and anyone who wants physics-accurate product spins or multi-shot character narratives through a single CLI command.

📦

Same repository

doany-ai/skills(29 items)

ai-video-generation

Installation

Vibe Index InstallInstalls to .claude/skills/

npx vibeindex add doany-ai/skills --skill ai-video-generation

skills.sh Install⚠ Installs to .agents/skills/

npx skills add doany-ai/skills --skill ai-video-generation

Manual InstallCopy SKILL.md content and save to the path below

~/.claude/skills/ai-video-generation/SKILL.md

SKILL.md

12Installs

AddedMay 18, 2026

View on GitHub Back to Skills

More from this repository10

🎯

image-outpainting🎯Skill

A Claude Code skill for image outpainting on RunComfy that extends images beyond their original canvas, supporting aspect ratio changes, uncropping, and canvas expansion by routing across Nano Banana 2 Edit, GPT Image 2 Edit, and FLUX Kontext Pro.

🎯

gpt-image-2🎯Skill

A Claude Code skill for generating and editing images with OpenAI GPT Image 2 (ChatGPT Images 2.0) hosted on RunComfy, with strengths in embedded text, logos, multilingual typography, and precise multi-element prompt following.

🎯

image-edit🎯Skill

A Claude Code skill that acts as a smart router for image editing on RunComfy, automatically selecting the best model (Nano Banana Edit, GPT Image 2 Edit, Flux Kontext Pro, or Z-Image Turbo Inpaint) based on the user's intent.

🎯

nano-banana-2🎯Skill

A Claude Code skill for generating images with Google Nano Banana 2, a Gemini-family flash-tier text-to-image model hosted on RunComfy, optimized for rapid iteration, social thumbnails, and in-image typography rendering.

🎯

seedance-v2🎯Skill

A Claude Code skill for generating cinematic short-form video with ByteDance Seedance 2.0 Pro via RunComfy, supporting multi-modal references (images, videos, audio) with native lip-synced audio and cinematic motion refinement.

🎯

nano-banana-edit🎯Skill

Edit images with Google Nano Banana 2 on RunComfy — preserve subject identity, swap backgrounds, localize edits with spatial language, and perform batch edits on up to 20 images in a single call. Best for identity-preserving edits and consistent multi-image processing.

🎯

runcomfy-cli🎯Skill

A unified CLI for the RunComfy Model API that provides one binary and one authentication to access hundreds of model endpoints including image generation, video generation, lip-sync, face swap, inpainting, outpainting, ControlNet, relighting, upscaling, and LoRA training.

🎯

flux-2-klein🎯Skill

Generate images with Flux 2 Klein, Black Forest Labs' distilled fast variant of Flux 2, on RunComfy. Optimized for sub-second latency and rapid creative iteration with multi-reference brand styling and declarative prompts, available in 9B and 4B variants.

🎯

happyhorse-1-0🎯Skill

A Claude Code skill for generating text-to-video with HappyHorse 1.0 on RunComfy, currently ranked #1 on Artificial Analysis Video Arena, featuring native 1080p output with in-pass synchronized audio and multi-shot character consistency.

🎯

face-swap🎯Skill

A Claude Code skill for face and character swapping in images and videos on RunComfy, routing across Wan 2-2 Animate, GPT Image 2 Edit, Nano Banana Edit, Flux Kontext, and Kling Motion Control based on the target medium and swap type.