π―Skills5
π―gemini-ttsπ―Skill
Generates natural-sounding speech from text using Google Gemini TTS models, supporting multiple voices, streaming, and multi-speaker conversations.
gemini-tts
π―gemini-batchπ―Skill
Efficiently process large volumes of AI requests using Gemini Batch API, enabling cost-effective bulk text generation and async job execution via scripts.
gemini-batch
π―gemini-embeddingsπ―Skill
Generates high-quality text embeddings using Gemini API for semantic search, similarity analysis, clustering, and RAG applications.
gemini-embeddings
π―gemini-imageπ―Skill
Generates high-quality AI images from text prompts using Google's Gemini and Imagen models, supporting multiple resolutions, aspect ratios, and creative styles.
gemini-image
π―gemini-textπ―Skill
Generates text content using Google Gemini models with advanced capabilities like multimodal prompts, thinking mode, JSON output, and search grounding.
gemini-text