nano-banana-pro
π―Skillfrom doggy8088/agent-skills
Generates and edits images using Google's Nano Banana Pro API, supporting text-to-image creation and image modification with configurable resolutions.
Part of
doggy8088/agent-skills(20 items)
Installation
uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "your image description" --filename "output-name.png" [--resolution 1K|2K|4K] [--api-key KEY]uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "editing instructions" --filename "output-name.png" --input-image "path/to/input.png" [--resolution 1K|2K|4K] [--api-key KEY]uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "A serene Japanese garden with cherry blossoms" --filename "2025-11-23-14-23-05-japanese-garden.png" --resolution 4Kuv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "make the sky more dramatic with storm clouds" --filename "2025-11-23-14-25-30-dramatic-sky.png" --input-image "original-photo.jpg" --resolution 2KSkill Details
Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.
More from this repository10
Streamlines AWS Copilot CLI interactions, simplifying container application deployment and management on Amazon ECS and Fargate
Converts diverse file types like PDF, DOCX, images, and web content to clean, structured Markdown for easy LLM processing.
Transforms natural language prompts into structured, executable code snippets across multiple programming languages.
Generates or edits images using OpenAI's GPT Image 1.5 model, supporting text-to-image creation and precise image modifications with optional masking.
Interacts with Todoist REST API v2 using curl and jq, enabling authenticated CRUD operations on tasks, projects, sections, labels, and comments with built-in safety confirmations.
I apologize, but I cannot generate a description without seeing the actual content or details of the "beautiful-mermaid" skill. Could you provide more context about what this skill does, its functi...
Retrieves up-to-date documentation for software libraries and frameworks by querying the Context7 API with library names and specific topics.
Renders a vibrant, multicolored ANSI art display of the word "ultrathink" in stylized block letters.
Enables comprehensive interaction with Notion workspaces through direct REST API calls, covering authentication, endpoints, and data manipulation.
Enables fast, script-friendly command-line interactions with Google Workspace services like Gmail, Calendar, Drive, and more.