video-caption-creation
๐ฏSkillfrom cdeistopened/opened-vault
Generates optimized, attention-grabbing captions and on-screen text hooks for short-form video platforms using a strategic Triple Word Score algorithm.
Installation
npx skills add https://github.com/cdeistopened/opened-vault --skill video-caption-creationSkill Details
Create optimized captions and hooks for short-form video (TikTok, Reels, Shorts) using proven hook categories and platform-specific Triple Word Score optimization system.
Overview
# Video Caption Writer
Purpose
This skill creates algorithm-optimized captions and on-screen text for short-form video content using the "Triple Word Score" system to maximize reach and engagement.
Core Philosophy: Content serves the topic first. When you serve the topic correctly, algorithms will serve your content to people interested in that topic.
When to Use This Skill
Use this skill when you need to:
- Create on-screen text hooks for video clips (3-5 options per clip)
- Write captions for short-form video (TikTok, Instagram, YouTube Shorts, etc.)
- Optimize video for algorithm discovery (topic identification in first 10 seconds)
- Generate strategic hashtag sets (10-12, broad to specific)
Do NOT use for:
- Text-only social posts (use
text-contentskill instead) - Long-form YouTube videos (separate workflow)
- Podcast show notes (use
podcast-productionskill)
---
The Triple Word Score System
Algorithms and humans need to immediately understand your content through FOUR signals:
1. Audio Transcript (MOST IMPORTANT)
- What you say out loud - algorithms auto-transcribe this
- Explicitly say topic words in first 10 seconds
- Repeat core terminology throughout
- Use natural speech patterns with topic-relevant vocabulary
2. On-Screen Text Reinforcement
- Visual overlay that reinforces spoken words
- Lead with topic words in first text overlay
- Use bold, clear terminology
- Don't compete with audio - reinforce it
3. Caption Copy Optimization
- Post description with topic-relevant keywords
- Open with topic-relevant phrases in first sentence
- Include natural keyword variations
- Provide context for algorithm understanding
4. Strategic Hashtag System
- 10-12 hashtags (optimal range)
- Mix broad and specific (#Education + #LearningStyles)
- Include audience-specific (#HomeschoolParents)
- Span from general to niche
When all four signals align, algorithms AND humans immediately recognize: "This is for me."
---
The 3-Step Workflow
Step 1: Generate On-Screen Text Hooks
Goal: Create 3-5 hook options that stop the scroll in first 3 seconds
#### The McDonald's Test
> "Someone who works at McDonald's should be able to read and understand your headline instantly."
- Avoid fancy vocabulary ("quadratic formula" vs "stop raising entitled kids")
- Choose accessible language over impressive terminology
- Wider net = better performance
#### Hook Categories
1. Polarizing Statements
- "Your kid's [negative thing] is actually genius"
- "Stop raising [undesirable trait] kids"
- "[Institution] operates like prisons"
2. Counter-Intuitive Reveals
- "My son's first job sucked. Perfect."
- "The worst experiences are the best teachers"
- "Every kid has genius hiding somewhere"
3. Direct Challenges
- "Never give up on the weird kid"
- "Don't judge the trench coat kid"
- "Make your kids love learning again"
4. Curiosity Gaps
- "Her boyfriend freaked out..." (for dramatic content)
- "The third tip will blow your mind"
- Use carefully - can create confusion if unclear
#### Generate 3-5 Options
For each video clip, create file: [Clip Name]_Captions.md
Format:
```markdown
# [Clip Name] - Caption Options
Source Material
[Brief description of video content, key quote, or main message]
On-Screen Text Hook Options (First 3 Seconds):
Option 1: Your kid's Minecraft addiction is actually genius
- Framework: Polarizing Statement
Option 2: Stop trying to limit screen time. Try this instead.
- Framework: Direct Challenge
Option 3: The one thing schools get wrong about boys
- Framework: Curiosity Gap
Option 4: Schools teach compliance. Minecraft teaches problem-solving.
- Framework: Contrast/Counter-Intuitive
Option 5: 3 reasons your kid's "screen addiction" might be genius
- Framework: Numbered List + Counter-Intuitive
RECOMMENDED: Option 1 (Polarizing) for Short-Form, Option 4 (Contrast) for LinkedIn
```
#### Quality Check for Hooks
- [ ] Would this make ME stop scrolling?
- [ ] Creates curiosity or emotion in 2 seconds?
- [ ] Can my grandmother understand instantly?
- [ ] Hints at payoff without giving it away?
- [ ] Passes McDonald's Test (accessible language)?
---
Step 2: Create Platform-Specific Captions
Goal: Write optimized captions for each target platform
Short-Form Video Platforms (Identical Treatment)
Applies to: YouTube Shorts, Instagram Reels, TikTok, Facebook Reels
- Character Limit: 2,200 (use 50-150 characters optimal)
- Hashtags: 3-5 (include platform tag: #Shorts for YouTube)
- Tone: Casual, energetic, hook in first few words
- Example: "Your kid's Minecraft addiction is actually genius ๐ฎ #Education #Parenting #Gaming"
Key: Hook immediately, use emojis, keep text scannable
---
LinkedIn (Separate Strategy)
Platform-Specific Guidelines:
- Character Limit: 3,000 (use 150-300 characters optimal)
- Hashtags: 3-5
- Tone: Professional yet conversational, value-driven
- Format: Hook โ insight โ CTA (save, comment, share)
- Example: "Your kid's 'screen addiction' might be teaching them more than traditional school. Minecraft builds problem-solving, creativity, and systems thinking. #Education #Parenting #21stCenturyLearning"
Note: For LinkedIn posts, use the text-content skill instead of this skill.
#### Platform-Specific Caption Template
```markdown
Short-Form Video Captions
YouTube Shorts / Instagram Reels / TikTok / Facebook Reels
Your kid's Minecraft addiction is actually genius ๐ฎ
Schools teach compliance. Minecraft teaches creation.
#Education #Parenting #Minecraft #Gaming
---
LinkedIn (Use social-content-creation skill)
Your kid's "screen addiction" might be teaching them more than traditional school.
Minecraft isn't just a gameโit's problem-solving, creativity, and systems thinking in action. Schools teach compliance. Minecraft teaches creation.
#Education #Parenting #21stCenturyLearning
```
Reference for detailed platform guidelines: references/platform-guidelines.md
---
Step 3: Apply Algorithm Optimization
Goal: Ensure content is discoverable and serves the right audience
#### Topic Identification Checklist
First 10 Seconds (Critical):
- [ ] Topic words spoken explicitly in audio
- [ ] On-screen text reinforces topic words
- [ ] Visual elements support topic (environment, materials, context)
- [ ] Energy/delivery matches topic expectations
Throughout Video:
- [ ] Core terminology repeated naturally
- [ ] Topic focus maintained (not scattered across multiple topics)
- [ ] Value delivered on topic promise
Caption & Metadata:
- [ ] Caption opens with topic-relevant phrase
- [ ] Natural keyword variations included
- [ ] Hashtags span broad to specific (10-12 total)
- [ ] Description provides context for algorithm
#### Strategic Hashtag System
Formula: Broad โ Mid โ Specific โ Niche
Example for education content:
Broad (1-2): #Education #Parenting
Mid (2-3): #Homeschool #AlternativeEducation #LearningDifferences
Specific (3-4): #Homeschooling #Microschools #Unschooling #ProjectBasedLearning
Niche (2-3): #OpenEducation #HomeschoolLife #LearningThroughPlay
Audience (1-2): #HomeschoolMom #HomeschoolParents
Platform (1): #Shorts (for YouTube), #Reels (for Instagram)
Total: 10-12 hashtags
#### Common Mistakes to Avoid
โ Misleading Signal Trap
- Topic says "education" but content is really entertainment
- โ Fix: Topic signals match actual content value
โ Buried Lede
- Topic words don't appear until middle of content
- โ Fix: Topic identification happens in first 10 seconds
โ Scatter Shot
- Trying to cover multiple topics in one piece
- โ Fix: Laser focus on one clear topic per piece
โ Brand First Approach
- Making content about your brand/company
- โ Fix: Making content about the topic, featuring your expertise
---
Output Format
Standard Output File: `[Clip Name]_Captions.md`
```markdown
# [Clip Name] - Captions & Optimization
Video Details
Duration: [XX seconds]
Core Message: [1-2 sentence summary]
Topic: [Primary topic for algorithm]
Target Audience: [Who this serves]
---
On-Screen Text Options (Choose 1)
Option 1: [Text]
- Category: [Hook category]
- Platform fit: [Platforms]
Option 2: [Text]
- Category: [Hook category]
- Platform fit: [Platforms]
Option 3: [Text]
- Category: [Hook category]
- Platform fit: [Platforms]
RECOMMENDED: [Which option and why]
---
Platform-Specific Captions
TikTok
[Caption text with emojis]
[Hashtags: 3-5]
---
Instagram Reels
[Caption text with emojis]
[Hashtags: 5-10]
---
YouTube Shorts
[Caption text]
[Hashtags: 3-5 including #Shorts]
---
[Caption text - more professional tone]
[Hashtags: 3-5]
---
[Caption text with emojis]
[Hashtags: 2-5]
---
Algorithm Optimization
Topic Keywords (spoken in first 10 seconds): [List keywords]
Secondary Keywords (throughout): [List keywords]
Hashtag Set (10-12 total):
- Broad: [2 tags]
- Mid: [3 tags]
- Specific: [4 tags]
- Niche: [3 tags]
First 10 Second Audio Script:
"[Exact words that will be spoken - must include topic keywords]"
---
Quality Checks
- [ ] McDonald's Test passed (accessible language)
- [ ] Topic clear in first 3 seconds
- [ ] On-screen text reinforces (not competes with) audio
- [ ] Platform character limits followed
- [ ] Hashtag counts appropriate per platform
- [ ] Triple Word Score complete (audio, on-screen, caption, hashtags)
```
---
Bundled Resources
This skill is self-contained. All guidance is inline above.
Cross-References
text-content/references/platforms/- Platform-specific guidelines for text postsshort-form-video/- Production workflow including FORMAT_INVENTORY.md
---
Success Metrics
A successful video caption:
โ Stops the scroll - Hook creates curiosity/emotion in first 3 seconds
โ Passes McDonald's Test - Accessible to broad audience
โ Triple Word Score complete - Audio, on-screen, caption, hashtags all aligned
โ Topic-first - Algorithm knows what this is about immediately
โ Platform-optimized - Character limits, hashtag counts, voice appropriate
โ Discoverable - Strategic hashtags span broad to niche
---
Related Skills
text-content- For text-only social posts (LinkedIn, X/Twitter, Facebook)youtube-clip-extractor- Complete workflow from YouTube URL to publishable clipsopened-daily-newsletter-writer- Newsletters often generate video clip conceptsshort-form-video- Full short-form video production workflow
---
Common Pitfalls to Avoid
Content Issues:
โ Jargon or fancy vocabulary in hooks
โ Giving away the payoff in the hook
โ On-screen text competes with (instead of reinforces) audio
โ Generic hooks everyone uses
Algorithm Issues:
โ Topic words not spoken in first 10 seconds
โ Too many topics in one video
โ Hashtags all broad or all niche (not spanning range)
โ Misleading hooks that don't match content
Platform Issues:
โ Same caption copy across all platforms
โ Too many hashtags (LinkedIn, X) or too few (Instagram)
โ Wrong tone for platform (too casual for LinkedIn, too formal for TikTok)
โ External links where they hurt reach (Facebook main post)
---
Version History
- v1.0 (2025-10-27): Initial skill creation
- Triple Word Score system
- 4 hook categories
- Platform-specific guidelines for 6 platforms
- Algorithm optimization checklist
---
For platform updates or new hook categories, add to references folder and note in version history
More from this repository10
Generates creative, tailored AI image prompts by brainstorming concepts, refining styles, and producing high-quality visuals via Gemini API.
Polishes raw podcast transcripts by removing filler words, adding structure, and improving readability while preserving the original speaker's authentic voice.
article-titles skill from cdeistopened/opened-vault
Transforms source content into platform-optimized social media posts using proven templates across LinkedIn, X, Facebook, and Instagram.
Extracts and optimizes compelling video clips from YouTube URLs, generating platform-ready assets with transcripts, captions, and on-screen text.
Rapidly prototype and publish short-form video content across platforms, using a fast, experimental approach to discover winning formats and hooks.
ghostwriter skill from cdeistopened/opened-vault
open-education-hub-deep-dives skill from cdeistopened/opened-vault
Extracts compelling story structures from raw content by identifying universal narrative beats and transforming them into engaging narratives.
Generates compelling 25-35 second podcast cold opens by extracting and rearranging narrative snippets at peak tension to instantly hook listeners.