speech-build
π―Skillfrom cnemri/google-genai-skills
A skill for building speech applications using Google GenAI, part of a curated collection of agentic skills designed to work with Gemini CLI, Antigravity, Claude Code, and other AI assistants.
Same repository
cnemri/google-genai-skills(10 items)
Installation
npx vibeindex add cnemri/google-genai-skills --skill speech-buildnpx skills add cnemri/google-genai-skills --skill speech-build~/.claude/skills/speech-build/SKILL.mdSKILL.md
More from this repository9
Provides expert guidance and Python code examples for building, configuring, and deploying intelligent agents using the Google Agent Development Kit (ADK).
Generates and edits videos using Google's Veo AI models with text, image, and reference-based inputs across multiple creative modes.
Provides expert Python code guidance for leveraging Google's Gemini API with the official GenAI SDK, covering text, chat, multimodal, and generative AI tasks.
Generates and edits videos using Google's Veo AI models, supporting text-to-video, image-to-video, and advanced video manipulation techniques.
A skill for conducting autonomous, multi-step research using the Gemini Deep Research Agent via the Interactions API, supporting web search, file/directory context, and resilient streaming.
Curated agentic skills for Google AI frameworks and models, compatible with Gemini CLI, Antigravity, Claude Code, and other AI coding assistants.
A skill for speech operations using Google's GenAI and Cloud Speech SDKs, supporting text-to-speech with Gemini-TTS, speech-to-text transcription with Chirp 3, and instant voice cloning.
A curated collection of agentic skills for Google AI frameworks and models, designed to work with Gemini CLI, Antigravity, Claude Code, and more.
Skill for generating and editing images using Gemini 2.5 Flash and Gemini 3 Pro image models via the google-genai Python SDK, supporting text-to-image, style transfer, and virtual try-on.