1 results for tag "minimax-multimodal-toolkit"
Unified entry point for MiniMax multimodal generation β TTS (text-to-speech, voice cloning, voice design, multi-segment), music (songs, instrumentals), video (text-to-video, image-to-video, start-end frame, subject reference, templates, long-form multi-scene), image (text-to-image, image-to-image with character reference), and FFmpeg-based media processing (convert/concat/trim/extract). Part of the MiniMax Skills suite for Claude Code, Cursor, Codex, and OpenCode.