nano-banana-pro

Name: nano-banana-pro
Author: intellectronica

AI image generationimage editingGemini 3 ProGoogle APIcreative AIdesign toolvisual contenttext-to-image

⭐ 273📄 CC0-1.0🕒 2026-04-25Source ↗

Install this skill

npx skills add intellectronica/agent-skills

Works across Claude Code, Cursor, Codex, Copilot & Antigravity

The Nano Banana Pro skill facilitates image synthesis and manipulation via the Gemini 3 Pro Image API. It operates directly from your active directory, ensuring all generated PNG assets remain local to your project workspace. The tool accepts natural language prompts to create original visuals from scratch or apply modifications to existing local image files. By standardizing filename conventions with embedded timestamps and mapping user language to resolution presets (1K, 2K, 4K), it simplifies the image creation pipeline for developers and designers. The tool requires a valid API key, either passed as a command argument or set via environmental variables, and handles execution through the uv package manager. This implementation is optimized for rapid iteration without requiring manual file management or external platform uploads.

When to Use This Skill

•Creating placeholder assets for web or mobile UI prototypes
•Modifying existing illustrations to match a specific art style
•Generating custom social media banners from textual descriptions
•Iterative prototyping of visual concepts during design brainstorming

How to Invoke This Skill

Example prompts that trigger this skill in Claude Code, Cursor, or Antigravity:

“Generate a 4K image of a futuristic city skyline
“Create a 2K image of a minimalist desk setup
“Edit this photo to make the background look like a watercolor painting
“Use the Nano Banana tool to generate a new graphic for the landing page
“Add storm clouds to this existing mountain photo

Pro Tips

💡Experiment with detailed and descriptive prompts for image generation to achieve precise results, specifying styles, colors, and content.
💡When editing, provide clear and concise instructions, referencing specific elements in the input image for accurate modifications.
💡Utilize the `4K` resolution option for high-fidelity outputs, especially for final assets or prints, but be mindful of processing time for larger resolutions.

What this skill does

•Generates high-resolution PNG images from text prompts
•Performs stylistic or content-based edits on local image files
•Supports 1K, 2K, and 4K output resolution toggles
•Applies timestamped, descriptive naming conventions to output files
•Provides direct execution via uv without path-switching overhead

When not to use it

✕Tasks requiring real-time video generation or animation
✕Scenarios needing extreme image upscaling beyond 4K parameters
✕Projects that require image analysis or metadata extraction instead of generation

Example workflow

User identifies a need for a new asset in the current directory
User executes the generation script with a specific descriptive prompt
Agent selects the appropriate resolution based on the user's request
Script generates the file and saves it with a timestamped filename
Agent confirms the saved file path back to the user
User proceeds to integrate the saved image into their project

Prerequisites

–Valid Gemini API key
–uv package manager installed
–Read/write permissions for the current working directory

Pitfalls & limitations

!Failing to provide an input image path during an edit request
!Over-describing filenames leading to long or truncated labels
!Environment variable conflicts if both a key argument and system variable exist

FAQ

How does the tool choose the output resolution?

It maps specific keywords to resolution settings. 'High-res' or '4K' triggers 4K, 'medium' or '2K' triggers 2K, while anything else defaults to 1K.

Do I need to change my directory to the skill folder to run this?

No. The tool is designed to run from your project directory using an absolute path to ensure images are saved exactly where you are working.

What happens if I don't provide an API key?

The script will look for a GEMINI_API_KEY environment variable. If neither the variable nor the argument is detected, the script will exit with an error.

How it compares

Unlike manual web-based generation, this tool automates file naming, directory placement, and parameter mapping, allowing for direct asset integration into your local code environment.

Source & trust

⭐ 273 stars📄 CC0-1.0🕒 Updated 2026-04-25

View original skill on GitHub →

📄 Full skill instructions — original source: intellectronica/agent-skills

# Nano Banana Pro Image Generation & Editing

Generate new images or edit existing ones using Google's Nano Banana Pro API (Gemini 3 Pro Image).

## Usage

Run the script using absolute path (do NOT cd to skill directory first):

**Generate new image:**

uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "your image description" --filename "output-name.png" [--resolution 1K|2K|4K] [--api-key KEY]

**Edit existing image:**

uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "editing instructions" --filename "output-name.png" --input-image "path/to/input.png" [--resolution 1K|2K|4K] [--api-key KEY]

**Important:** Always run from the user's current working directory so images are saved where the user is working, not in the skill directory.

## Resolution Options

The Gemini 3 Pro Image API supports three resolutions (uppercase K required):

- **1K** (default) - ~1024px resolution
- **2K** - ~2048px resolution
- **4K** - ~4096px resolution

Map user requests to API parameters:
- No mention of resolution → 1K
- "low resolution", "1080", "1080p", "1K" → 1K
- "2K", "2048", "normal", "medium resolution" → 2K
- "high resolution", "high-res", "hi-res", "4K", "ultra" → 4K

## API Key

The script checks for API key in this order:
1. --api-key argument (use if user provided key in chat)
2. GEMINI_API_KEY environment variable

If neither is available, the script exits with an error message.

## Filename Generation

Generate filenames with the pattern: yyyy-mm-dd-hh-mm-ss-name.png

**Format:** {timestamp}-{descriptive-name}.png
- Timestamp: Current date/time in format yyyy-mm-dd-hh-mm-ss (24-hour format)
- Name: Descriptive lowercase text with hyphens
- Keep the descriptive part concise (1-5 words typically)
- Use context from user's prompt or conversation
- If unclear, use random identifier (e.g., x9k2, a7b3)

Examples:
- Prompt "A serene Japanese garden" → 2025-11-23-14-23-05-japanese-garden.png
- Prompt "sunset over mountains" → 2025-11-23-15-30-12-sunset-mountains.png
- Prompt "create an image of a robot" → 2025-11-23-16-45-33-robot.png
- Unclear context → 2025-11-23-17-12-48-x9k2.png

## Image Editing

When the user wants to modify an existing image:
1. Check if they provide an image path or reference an image in the current directory
2. Use --input-image parameter with the path to the image
3. The prompt should contain editing instructions (e.g., "make the sky more dramatic", "remove the person", "change to cartoon style")
4. Common editing tasks: add/remove elements, change style, adjust colors, blur background, etc.

## Prompt Handling

**For generation:** Pass user's image description as-is to --prompt. Only rework if clearly insufficient.

**For editing:** Pass editing instructions in --prompt (e.g., "add a rainbow in the sky", "make it look like a watercolor painting")

Preserve user's creative intent in both cases.

## Output

- Saves PNG to current directory (or specified path if filename includes directory)
- Script outputs the full path to the generated image
- **Do not read the image back** - just inform the user of the saved path

## Examples

**Generate new image:**

uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "A serene Japanese garden with cherry blossoms" --filename "2025-11-23-14-23-05-japanese-garden.png" --resolution 4K

**Edit existing image:**

uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "make the sky more dramatic with storm clouds" --filename "2025-11-23-14-25-30-dramatic-sky.png" --input-image "original-photo.jpg" --resolution 2K

By intellectronica

How to Use This Skill Unit

Option A: Project-Specific (Recommended)

Click "Download" above
In your project, create the directory: .agent/skills/nano-banana-pro/
Save the file as SKILL.md
The agent will automatically discover the skill based on its description.

Option B: Global Installation (All Agents)

Save the file to these locations to make it available across all projects:

Claude Code: ~/.claude/skills/intellectronica/agent-skills/nano-banana-pro/SKILL.md
Cursor: ~/.cursor/skills/intellectronica/agent-skills/nano-banana-pro/SKILL.md
Antigravity: ~/.gemini/antigravity/skills/intellectronica/agent-skills/nano-banana-pro/SKILL.md