🎯

gemini-image-generator

🎯Skill

from mkdev-me/claude-skills

What it does

Generates high-quality images from text prompts or reference images using Google Gemini's AI, with customizable sizes and styles.

gemini-image-generator

Installation

git cloneClone repository

git clone https://github.com/mkdev-me/claude-skills.git

PythonRun Python server

python3 -m venv venv

📖 Extracted from docs: mkdev-me/claude-skills

Need more details? View full documentation on GitHub →

6Installs

AddedFeb 4, 2026

View on GitHub Back to Skills

Skill Details

SKILL.md

Generate images using Google Gemini with customizable options

Overview

# gemini-image-generator

Instructions

Use this skill to generate images using Google Gemini's image generation model. The skill supports:

Text-to-image generation from prompts
Image-to-image generation with a reference image
Multiple output sizes (1K, 2K, 4K)
Custom output paths

The API key must be set via the GEMINI_API_KEY environment variable.

Parameters

--prompt (required): The text prompt describing the image to generate
--output (required): Output file path for the generated image
--reference: Optional reference image for style/content guidance
--size: Image size - "1K", "2K", or "4K" (default: 4K)

Examples

Basic text-to-image generation

```bash

./scripts/generate.py --prompt "A serene mountain landscape at sunset" --output images/landscape.png

```

With reference image for style guidance

```bash

./scripts/generate.py --prompt "Same character but wearing a party hat" --reference images/character.png --output images/party.png

```

Different output size

```bash

./scripts/generate.py --prompt "Abstract art" --output art.png --size 2K

```

Setup

Before first use, set up the virtual environment:

```bash

cd scripts && python3 -m venv venv && ./venv/bin/pip install -r requirements.txt

```

Set your API key:

```bash

export GEMINI_API_KEY="your-api-key-here"

```