yt-transcribe
π―Skillfrom mhagrelius/dotfiles
Transcribes YouTube video content, extracting spoken text from videos for quick understanding and analysis.
Installation
npx skills add https://github.com/mhagrelius/dotfiles --skill yt-transcribeSkill Details
Use when user asks about YouTube video content, wants to know what a video says, needs information from a YouTube URL, or when video transcription would answer their question
Overview
# YouTube Video Transcription
Overview
Use yt-transcribe to get the actual spoken content from YouTube videos. Web search and fetch tools cannot access video content - they only see metadata.
When to Use
Use this skill when:
- User shares a YouTube URL and asks what it says/contains
- User wants information that likely exists in a video
- User asks "what does this video talk about?"
- You need to extract spoken content from YouTube
Do NOT use:
- For video metadata only (title, description) - web search is faster
- When user explicitly wants just the link, not content
Quick Reference
| Flag | Purpose |
|------|---------|
| -q | Required for LLMs - clean stdout, no progress noise |
| -m tiny | Faster transcription (less accurate) |
| -m medium | More accurate (slower, larger model) |
| -o file | Save to file instead of stdout |
| -f srt/vtt | Include timestamps |
Core Pattern
```bash
# Standard usage for LLM consumption
yt-transcribe "https://youtube.com/watch?v=VIDEO_ID" -q
# Faster but less accurate
yt-transcribe "https://youtube.com/watch?v=VIDEO_ID" -q -m tiny
# With timestamps
yt-transcribe "https://youtube.com/watch?v=VIDEO_ID" -q -f srt
```
Always use -q flag - suppresses progress output, gives clean transcript to stdout.
Workflow
- User provides YouTube URL or asks about video content
- Run:
yt-transcribe "URL" -q - Read and summarize the transcript for the user
- Answer their specific question using the transcript content
Common Mistakes
| Mistake | Fix |
|---------|-----|
| Trying WebFetch on YouTube URLs | YouTube blocks bots, use yt-transcribe |
| Forgetting -q flag | Progress output pollutes response, always use -q |
| Web searching for "what does video say" | Search finds metadata, not content - transcribe instead |
| Using without setup | Run yt-transcribe --setup first if binaries missing (exit code 5) |
Exit Codes
0- Success5- Binary not found (run--setup)3- Network error4- Transcription error
More from this repository8
Modernizes .NET development with C# 14 best practices, minimal APIs, modular architecture, and advanced infrastructure patterns.
Helps debug and configure .NET Aspire distributed applications, resolving service discovery, environment, deployment, and polyglot integration challenges.
Builds interactive terminal user interfaces (TUIs) with responsive layouts, keyboard navigation, and real-time data updates across multiple programming languages.
Guides developers in creating robust, Unix-philosophy-aligned command-line interface tools with best practices for argument parsing, stream handling, and cross-language implementation.
Enables precise Python code navigation and type checking using pyright-lsp, finding references, definitions, and verifying type correctness semantically.
Helps developers architect and debug GTK 4/libadwaita applications by addressing lifecycle, threading, resource management, and packaging challenges.
Enables semantic code intelligence for TypeScript/JavaScript, providing accurate references, type checking, definition navigation, and symbol understanding.
Designs GNOME user interfaces with precision, ensuring Human Interface Guidelines compliance and modern libadwaita styling.