🎯

media-processing

🎯Skill

from binhmuc/autobot-review

VibeIndex|
What it does

Processes multimedia files using FFmpeg, ImageMagick, and RMBG to convert, encode, resize, filter, and manipulate video, audio, and image files with advanced capabilities.

πŸ“¦

Part of

binhmuc/autobot-review(29 items)

media-processing

Installation

npm installInstall npm package
npm install -g rmbg-cli
πŸ“– Extracted from docs: binhmuc/autobot-review
13Installs
-
AddedFeb 4, 2026

Skill Details

SKILL.md

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects, composition), and RMBG (AI-powered background removal). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, removing backgrounds from images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs.

Overview

# Media Processing Skill

Process video, audio, and images using FFmpeg, ImageMagick, and RMBG CLI tools.

Tool Selection

| Task | Tool | Reason |

|------|------|--------|

| Video encoding/conversion | FFmpeg | Native codec support, streaming |

| Audio extraction/conversion | FFmpeg | Direct stream manipulation |

| Image resize/effects | ImageMagick | Optimized for still images |

| Background removal | RMBG | AI-powered, local processing |

| Batch images | ImageMagick | mogrify for in-place edits |

| Video thumbnails | FFmpeg | Frame extraction built-in |

| GIF creation | FFmpeg/ImageMagick | FFmpeg for video, ImageMagick for images |

Installation

```bash

# macOS

brew install ffmpeg imagemagick

npm install -g rmbg-cli

# Ubuntu/Debian

sudo apt-get install ffmpeg imagemagick

npm install -g rmbg-cli

# Verify

ffmpeg -version && magick -version && rmbg --version

```

Essential Commands

```bash

# Video: Convert/re-encode

ffmpeg -i input.mkv -c copy output.mp4

ffmpeg -i input.avi -c:v libx264 -crf 22 -c:a aac output.mp4

# Video: Extract audio

ffmpeg -i video.mp4 -vn -c:a copy audio.m4a

# Image: Convert/resize

magick input.png output.jpg

magick input.jpg -resize 800x600 output.jpg

# Image: Batch resize

mogrify -resize 800x -quality 85 *.jpg

# Background removal

rmbg input.jpg # Basic (modnet)

rmbg input.jpg -m briaai -o output.png # High quality

rmbg input.jpg -m u2netp -o output.png # Fast

```

Key Parameters

FFmpeg:

  • -c:v libx264 - H.264 codec
  • -crf 22 - Quality (0-51, lower=better)
  • -preset slow - Speed/compression balance
  • -c:a aac - Audio codec

ImageMagick:

  • 800x600 - Fit within (maintains aspect)
  • 800x600^ - Fill (may crop)
  • -quality 85 - JPEG quality
  • -strip - Remove metadata

RMBG:

  • -m briaai - High quality model
  • -m u2netp - Fast model
  • -r 4096 - Max resolution

References

Detailed guides in references/:

  • ffmpeg-encoding.md - Codecs, quality, hardware acceleration
  • ffmpeg-streaming.md - HLS/DASH, live streaming
  • ffmpeg-filters.md - Filters, complex filtergraphs
  • imagemagick-editing.md - Effects, transformations
  • imagemagick-batch.md - Batch processing, parallel ops
  • rmbg-background-removal.md - AI models, CLI usage
  • common-workflows.md - Video optimization, responsive images, GIF creation
  • troubleshooting.md - Error fixes, performance tips
  • format-compatibility.md - Format support, codec recommendations

More from this repository10

🎯
mobile-development🎯Skill

mobile-development skill from binhmuc/autobot-review

🎯
planning🎯Skill

planning skill from binhmuc/autobot-review

🎯
payment-integration🎯Skill

payment-integration skill from binhmuc/autobot-review

🎯
chrome-devtools🎯Skill

Automates browser tasks using Puppeteer, enabling web scraping, performance analysis, screenshots, and debugging with JSON output.

🎯
research🎯Skill

Systematically researches technical solutions by gathering multi-source information, analyzing content, and validating findings to provide scalable, secure, and maintainable recommendations.

🎯
ui-styling🎯Skill

Crafts beautiful, accessible user interfaces using shadcn/ui components, Tailwind CSS utility styling, and canvas-based visual design systems.

🎯
devops🎯Skill

Deploys and manages cloud infrastructure across Cloudflare, Docker, and Google Cloud Platform with comprehensive edge computing and containerization strategies.

🎯
shopify🎯Skill

Builds and deploys Shopify applications, extensions, and themes using GraphQL/REST APIs, Shopify CLI, and Liquid templating for comprehensive e-commerce platform customization.

🎯
repomix🎯Skill

Packages entire code repositories into single AI-friendly files with customizable filters, formats, and optimizations for LLM context.

🎯
databases🎯Skill

Guides developers in selecting and mastering MongoDB and PostgreSQL databases for optimal data management and performance.