8 results for tag "gemini-image"
Analyzes images using Gemini Pro's vision capabilities for tasks like text extraction (OCR) from screenshots, UI analysis, error diagnosis, diagram understanding, and general image description.
Generate images using Google Gemini and Imagen models via scripts/. Use for AI image generation, text-to-image, creating visuals from prompts, generating multiple images, custom aspect ratios, and high-resolution output up to 4K. Triggers on "generate image", "create image", "imagen", "text to image", "AI art", "nano banana".