gemini-image
π―Skillfrom akrindev/google-studio-skills
Generates high-quality AI images from text prompts using Google's Gemini and Imagen models, supporting multiple resolutions, aspect ratios, and creative styles.
Part of
akrindev/google-studio-skills(5 items)
Installation
python scripts/generate_image.py "A futuristic city at sunset with flying cars"python scripts/generate_image.py "Minimalist coffee shop interior" --aspect 1:1 --size 2K --name coffee-shoppython scripts/generate_image.py "Tech gadget review thumbnail with vibrant colors" --aspect 16:9 --size 2K --name thumbnailpython scripts/generate_image.py "Abstract geometric patterns in blue and gold" --num 4 --name abstractpython scripts/generate_image.py "Detailed architectural rendering of modern museum" --aspect 16:9 --size 4K --output-dir ./professional/ --name museum+ 14 more commands
Skill Details
Generate images using Google Gemini and Imagen models via scripts/. Use for AI image generation, text-to-image, creating visuals from prompts, generating multiple images, custom aspect ratios, and high-resolution output up to 4K. Triggers on "generate image", "create image", "imagen", "text to image", "AI art", "nano banana".
More from this repository4
Generates natural-sounding speech from text using Google Gemini TTS models, supporting multiple voices, streaming, and multi-speaker conversations.
Efficiently process large volumes of AI requests using Gemini Batch API, enabling cost-effective bulk text generation and async job execution via scripts.
Generates high-quality text embeddings using Gemini API for semantic search, similarity analysis, clustering, and RAG applications.
Generates text content using Google Gemini models with advanced capabilities like multimodal prompts, thinking mode, JSON output, and search grounding.