🎯

ai-multimodal

🎯Skill

from mrgoonie/claudekit-skills

What it does

Processes and generates multimedia content using Google Gemini API, including audio transcription, image analysis, video processing, and document extraction across multiple formats.

📦

Same repository

mrgoonie/claudekit-skills(33 items)

ai-multimodal

Installation

Vibe Index InstallInstalls to .claude/skills/

npx vibeindex add mrgoonie/claudekit-skills --skill ai-multimodal

skills.sh Install⚠ Installs to .agents/skills/

npx skills add mrgoonie/claudekit-skills --skill ai-multimodal

Manual InstallCopy SKILL.md content and save to the path below

~/.claude/skills/ai-multimodal/SKILL.md

SKILL.md

446Installs

1,629

Last UpdatedJan 21, 2026

View on GitHub Back to Skills

More from this repository10

🎯

backend-development🎯Skill

Guides building robust backend systems with modern technologies like Node.js, Python, Go, and Rust, covering API design, database integration, authentication, security best practices, and scalability patterns.

🎯

sequential-thinking🎯Skill

Enables structured problem-solving through iterative step-by-step reasoning with the ability to revise thoughts, branch into alternatives, and dynamically adjust scope.

🎯

threejs🎯Skill

Three.js development skill for building high-performance 3D web apps covering WebGL/WebGPU rendering, PBR materials, custom shaders, VR/XR, physics, and post-processing effects

🎯

devops🎯Skill

Deploys and manages cloud infrastructure across Cloudflare Workers, Docker, Google Cloud, and Kubernetes with CI/CD, GitOps, and security audit capabilities.

🎯

chrome-devtools🎯Skill

Automates browser interactions, debugging, and performance analysis using Puppeteer CLI scripts that output JSON for easy parsing.

🎯

aesthetic🎯Skill

Aesthetic design skill for creating beautiful interfaces following proven design principles, covering visual hierarchy, color theory, micro-interactions, and design system guidance with chrome-devtools and AI multimodal integration

🎯

ui-styling🎯Skill

UI styling skill for creating accessible user interfaces with shadcn/ui components (Radix UI + Tailwind), utility-first Tailwind CSS styling, canvas-based visual designs, and consistent theming including dark mode

🎯

problem-solving🎯Skill

Creative problem-solving skill collection with techniques for breaking through stuck points, including collision-zone thinking, inversion, pattern recognition, and simplification approaches

🎯

media-processing🎯Skill

Processes multimedia files with FFmpeg for video/audio encoding, conversion, and streaming, and ImageMagick for image manipulation, batch processing, and effects.

🎯

code-review🎯Skill

Enforces rigorous code review practices by systematically evaluating feedback, requesting reviews, and verifying technical correctness before claiming task completion.