kling-3-0
🎯Skillfrom doany-ai/skills
A Claude Code skill for generating video with Kuaishou Kling 3.0 on RunComfy, covering all six endpoints across three quality tiers (Standard, Pro, 4K) and two modes (text-to-video, image-to-video) with native synchronized audio and character consistency.
Overview
Kling 3.0 is a Claude Code skill for generating cinematic video with Kuaishou Technology's third-generation video model on RunComfy. It covers all six Kling 3.0 rendering endpoints, spanning three quality tiers (Standard, Pro, 4K) and two modes (text-to-video, image-to-video). Kling 3.0 produces multi-shot video with synchronized native audio, consistent character identity across shots, and physics-aware motion, with support for clips up to 15 seconds and native 4K output on the 4K tier.
Key Features
- Three quality tiers: Standard (cheapest, up to 1080p for fast iteration and social shorts), Pro (highest fidelity at 1080p for hero-quality clips), and 4K (native 3840x2160 for brand films and big-screen sequences)
- Dual input modes: Text-to-video generates from text prompts, while image-to-video animates a reference image into motion, both available across all three quality tiers
- Native synchronized audio: Generates dialogue, ambient sounds, and music in-pass, with optional audio generation that adds a per-second surcharge
- Multi-prompt segment system: A unified system that allows one generation to contain several distinct scenes with controlled transitions, enabling multi-shot narratives in a single call
- Physics-aware motion and character consistency: Maintains realistic motion physics and consistent character identity (face, wardrobe, build) across shots
Who is this for?
- Film and video production teams creating cinematic sequences, brand films, and big-screen content that requires up to native 4K resolution with synchronized audio
- Social media and advertising teams producing short-form video ads and social content across quality tiers, from rapid Standard-tier iteration to polished Pro-tier finals
- Creative directors and storyboard artists prototyping multi-shot narratives with consistent characters using the multi-prompt segment system
Same repository
doany-ai/skills(29 items)
Installation
npx vibeindex add doany-ai/skills --skill kling-3-0npx skills add doany-ai/skills --skill kling-3-0~/.claude/skills/kling-3-0/SKILL.mdSKILL.md
More from this repository10
A Claude Code skill for image outpainting on RunComfy that extends images beyond their original canvas, supporting aspect ratio changes, uncropping, and canvas expansion by routing across Nano Banana 2 Edit, GPT Image 2 Edit, and FLUX Kontext Pro.
A Claude Code skill for pose-conditioned image and video generation on RunComfy, routing across Kling 2-6 Motion Control (video motion transfer), Wan 2-2 Animate (audio-driven character animation), and Z-Image Turbo ControlNet LoRA (pose-conditioned image generation).
A Claude Code skill that creates AI avatar and talking-head videos on RunComfy, intelligently routing across OmniHuman, Wan 2-7, HappyHorse, and Seedance models based on user intent such as UGC voiceover, virtual presenter, or lip-synced character.
A Claude Code skill for generating images with Google Nano Banana 2, a Gemini-family flash-tier text-to-image model hosted on RunComfy, optimized for rapid iteration, social thumbnails, and in-image typography rendering.
Extend a video's spatial canvas on RunComfy — uncrop, change aspect ratio (e.g., 9:16 to 16:9), or add environment beyond the original frame while preserving the central action. Routes through Wan 2-7 edit-video and dedicated ComfyUI outpaint workflows.
A Claude Code skill that extends existing video clips on RunComfy using Google Veo 3-1's extend-video endpoints, continuing clips past their duration cap or chaining narrative shots while preserving consistent motion, lighting, and subject identity.
Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy — studio-quality 44.1 kHz stereo audio from 5 seconds to 5 minutes with section-level control (Intro, Verse, Chorus, Bridge), multilingual vocals, and commercial-friendly output.
A Claude Code skill for generating and editing images with OpenAI GPT Image 2 (ChatGPT Images 2.0) hosted on RunComfy, with strengths in embedded text, logos, multilingual typography, and precise multi-element prompt following.
Animate any still image into video on RunComfy, routing to the best i2v model for each intent — HappyHorse 1.0 for general animations with native audio, Wan 2.7 with audio_url for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image plus reference video and audio.
A Claude Code skill that acts as a smart router for video editing on RunComfy, automatically selecting the best model -- Wan 2.7 Edit-Video for general restyle, Kling 2.6 Pro for motion transfer, or Lucy Edit Restyle for lightweight outfit/background swaps.