Back to AI Tools & Agents

nano-banana-pro

AI image generationimage editingGemini 3 ProGoogle APIcreative AIdesign toolvisual contenttext-to-image
⭐ 273πŸ“„ CC0-1.0πŸ•’ 2026-04-25Source β†—

Install this skill

npx skills add intellectronica/agent-skills

Works across Claude Code, Cursor, Codex, Copilot & Antigravity

The Nano Banana Pro skill facilitates image synthesis and manipulation via the Gemini 3 Pro Image API. It operates directly from your active directory, ensuring all generated PNG assets remain local to your project workspace. The tool accepts natural language prompts to create original visuals from scratch or apply modifications to existing local image files. By standardizing filename conventions with embedded timestamps and mapping user language to resolution presets (1K, 2K, 4K), it simplifies the image creation pipeline for developers and designers. The tool requires a valid API key, either passed as a command argument or set via environmental variables, and handles execution through the uv package manager. This implementation is optimized for rapid iteration without requiring manual file management or external platform uploads.

When to Use This Skill

  • β€’Creating placeholder assets for web or mobile UI prototypes
  • β€’Modifying existing illustrations to match a specific art style
  • β€’Generating custom social media banners from textual descriptions
  • β€’Iterative prototyping of visual concepts during design brainstorming

How to Invoke This Skill

Example prompts that trigger this skill in Claude Code, Cursor, or Antigravity:

  • β€œGenerate a 4K image of a futuristic city skyline
  • β€œCreate a 2K image of a minimalist desk setup
  • β€œEdit this photo to make the background look like a watercolor painting
  • β€œUse the Nano Banana tool to generate a new graphic for the landing page
  • β€œAdd storm clouds to this existing mountain photo

Pro Tips

  • πŸ’‘Experiment with detailed and descriptive prompts for image generation to achieve precise results, specifying styles, colors, and content.
  • πŸ’‘When editing, provide clear and concise instructions, referencing specific elements in the input image for accurate modifications.
  • πŸ’‘Utilize the `4K` resolution option for high-fidelity outputs, especially for final assets or prints, but be mindful of processing time for larger resolutions.

What this skill does

  • β€’Generates high-resolution PNG images from text prompts
  • β€’Performs stylistic or content-based edits on local image files
  • β€’Supports 1K, 2K, and 4K output resolution toggles
  • β€’Applies timestamped, descriptive naming conventions to output files
  • β€’Provides direct execution via uv without path-switching overhead

When not to use it

  • βœ•Tasks requiring real-time video generation or animation
  • βœ•Scenarios needing extreme image upscaling beyond 4K parameters
  • βœ•Projects that require image analysis or metadata extraction instead of generation

Example workflow

  1. User identifies a need for a new asset in the current directory
  2. User executes the generation script with a specific descriptive prompt
  3. Agent selects the appropriate resolution based on the user's request
  4. Script generates the file and saves it with a timestamped filename
  5. Agent confirms the saved file path back to the user
  6. User proceeds to integrate the saved image into their project

Prerequisites

  • –Valid Gemini API key
  • –uv package manager installed
  • –Read/write permissions for the current working directory

Pitfalls & limitations

  • !Failing to provide an input image path during an edit request
  • !Over-describing filenames leading to long or truncated labels
  • !Environment variable conflicts if both a key argument and system variable exist

FAQ

How does the tool choose the output resolution?
It maps specific keywords to resolution settings. 'High-res' or '4K' triggers 4K, 'medium' or '2K' triggers 2K, while anything else defaults to 1K.
Do I need to change my directory to the skill folder to run this?
No. The tool is designed to run from your project directory using an absolute path to ensure images are saved exactly where you are working.
What happens if I don't provide an API key?
The script will look for a GEMINI_API_KEY environment variable. If neither the variable nor the argument is detected, the script will exit with an error.

How it compares

Unlike manual web-based generation, this tool automates file naming, directory placement, and parameter mapping, allowing for direct asset integration into your local code environment.

Source & trust

⭐ 273 starsπŸ“„ CC0-1.0πŸ•’ Updated 2026-04-25
πŸ“„ Full skill instructions β€” original source: intellectronica/agent-skills
# Nano Banana Pro Image Generation & Editing

Generate new images or edit existing ones using Google's Nano Banana Pro API (Gemini 3 Pro Image).

## Usage

Run the script using absolute path (do NOT cd to skill directory first):

**Generate new image:**
uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "your image description" --filename "output-name.png" [--resolution 1K|2K|4K] [--api-key KEY]


**Edit existing image:**
uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "editing instructions" --filename "output-name.png" --input-image "path/to/input.png" [--resolution 1K|2K|4K] [--api-key KEY]


**Important:** Always run from the user's current working directory so images are saved where the user is working, not in the skill directory.

## Resolution Options

The Gemini 3 Pro Image API supports three resolutions (uppercase K required):

- **1K** (default) - ~1024px resolution
- **2K** - ~2048px resolution
- **4K** - ~4096px resolution

Map user requests to API parameters:
- No mention of resolution β†’ 1K
- "low resolution", "1080", "1080p", "1K" β†’ 1K
- "2K", "2048", "normal", "medium resolution" β†’ 2K
- "high resolution", "high-res", "hi-res", "4K", "ultra" β†’ 4K

## API Key

The script checks for API key in this order:
1. --api-key argument (use if user provided key in chat)
2. GEMINI_API_KEY environment variable

If neither is available, the script exits with an error message.

## Filename Generation

Generate filenames with the pattern: yyyy-mm-dd-hh-mm-ss-name.png

**Format:** {timestamp}-{descriptive-name}.png
- Timestamp: Current date/time in format yyyy-mm-dd-hh-mm-ss (24-hour format)
- Name: Descriptive lowercase text with hyphens
- Keep the descriptive part concise (1-5 words typically)
- Use context from user's prompt or conversation
- If unclear, use random identifier (e.g., x9k2, a7b3)

Examples:
- Prompt "A serene Japanese garden" β†’ 2025-11-23-14-23-05-japanese-garden.png
- Prompt "sunset over mountains" β†’ 2025-11-23-15-30-12-sunset-mountains.png
- Prompt "create an image of a robot" β†’ 2025-11-23-16-45-33-robot.png
- Unclear context β†’ 2025-11-23-17-12-48-x9k2.png

## Image Editing

When the user wants to modify an existing image:
1. Check if they provide an image path or reference an image in the current directory
2. Use --input-image parameter with the path to the image
3. The prompt should contain editing instructions (e.g., "make the sky more dramatic", "remove the person", "change to cartoon style")
4. Common editing tasks: add/remove elements, change style, adjust colors, blur background, etc.

## Prompt Handling

**For generation:** Pass user's image description as-is to --prompt. Only rework if clearly insufficient.

**For editing:** Pass editing instructions in --prompt (e.g., "add a rainbow in the sky", "make it look like a watercolor painting")

Preserve user's creative intent in both cases.

## Output

- Saves PNG to current directory (or specified path if filename includes directory)
- Script outputs the full path to the generated image
- **Do not read the image back** - just inform the user of the saved path

## Examples

**Generate new image:**
uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "A serene Japanese garden with cherry blossoms" --filename "2025-11-23-14-23-05-japanese-garden.png" --resolution 4K


**Edit existing image:**
uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "make the sky more dramatic with storm clouds" --filename "2025-11-23-14-25-30-dramatic-sky.png" --input-image "original-photo.jpg" --resolution 2K

How to Use This Skill Unit

Option A: Project-Specific (Recommended)

  1. Click "Download" above
  2. In your project, create the directory: .agent/skills/nano-banana-pro/
  3. Save the file as SKILL.md
  4. The agent will automatically discover the skill based on its description.

Option B: Global Installation (All Agents)

Save the file to these locations to make it available across all projects:

  • Claude Code: ~/.claude/skills/intellectronica/agent-skills/nano-banana-pro/SKILL.md
  • Cursor: ~/.cursor/skills/intellectronica/agent-skills/nano-banana-pro/SKILL.md
  • Antigravity: ~/.gemini/antigravity/skills/intellectronica/agent-skills/nano-banana-pro/SKILL.md

πŸš€ Install with CLI:
npx skills add intellectronica/agent-skills

Read the Master Guide: Mastering Agent Skills β†’

Recommended Rules

View more rules β†’

Recommended Workflows

View more workflows β†’

Recommended MCP Servers

View more MCP servers β†’

Take It Further

Maximize your productivity with these powerful resources

πŸ“‹

Define Your Standards

Set up coding standards to ensure this workflow produces consistent, high-quality results.

Browse Rules Library
πŸ“–

Master Workflows

Learn how to create custom workflows, use Turbo Mode, and build your automation library.

Complete Guide

How to use this Skill in Claude Code & Cursor

For Claude Code (CLI)

To use this skill in Claude Code, copy the rule content into your project's custom instructions or follow our Add-Skill CLI guide. This ensures Claude follows your standards during every code generation.

For Cursor & Windsurf

For Cursor or Windsurf, individual skills are best used in the "Rules for AI" section. This specific unit helps the agent avoid ai tools & agents issues, leading to cleaner, more efficient code.

Why the skill format matters: the standardized Agent Skills format lets your AI agent load detailed instructions only when they are relevant, keeping your prompt clean while improving results.

Source & attribution

This skill is categorized under AI Tools & Agents and is published by intellectronica, maintained in intellectronica/agent-skills.

← Browse All Agent Skills
Sponsored AI assistant. Recommendations may be paid.