nano-banana-pro
Install this skill
npx skills add intellectronica/agent-skillsWorks across Claude Code, Cursor, Codex, Copilot & Antigravity
The Nano Banana Pro skill facilitates image synthesis and manipulation via the Gemini 3 Pro Image API. It operates directly from your active directory, ensuring all generated PNG assets remain local to your project workspace. The tool accepts natural language prompts to create original visuals from scratch or apply modifications to existing local image files. By standardizing filename conventions with embedded timestamps and mapping user language to resolution presets (1K, 2K, 4K), it simplifies the image creation pipeline for developers and designers. The tool requires a valid API key, either passed as a command argument or set via environmental variables, and handles execution through the uv package manager. This implementation is optimized for rapid iteration without requiring manual file management or external platform uploads.
When to Use This Skill
- β’Creating placeholder assets for web or mobile UI prototypes
- β’Modifying existing illustrations to match a specific art style
- β’Generating custom social media banners from textual descriptions
- β’Iterative prototyping of visual concepts during design brainstorming
How to Invoke This Skill
Example prompts that trigger this skill in Claude Code, Cursor, or Antigravity:
- βGenerate a 4K image of a futuristic city skyline
- βCreate a 2K image of a minimalist desk setup
- βEdit this photo to make the background look like a watercolor painting
- βUse the Nano Banana tool to generate a new graphic for the landing page
- βAdd storm clouds to this existing mountain photo
Pro Tips
- π‘Experiment with detailed and descriptive prompts for image generation to achieve precise results, specifying styles, colors, and content.
- π‘When editing, provide clear and concise instructions, referencing specific elements in the input image for accurate modifications.
- π‘Utilize the `4K` resolution option for high-fidelity outputs, especially for final assets or prints, but be mindful of processing time for larger resolutions.
What this skill does
- β’Generates high-resolution PNG images from text prompts
- β’Performs stylistic or content-based edits on local image files
- β’Supports 1K, 2K, and 4K output resolution toggles
- β’Applies timestamped, descriptive naming conventions to output files
- β’Provides direct execution via uv without path-switching overhead
When not to use it
- βTasks requiring real-time video generation or animation
- βScenarios needing extreme image upscaling beyond 4K parameters
- βProjects that require image analysis or metadata extraction instead of generation
Example workflow
- User identifies a need for a new asset in the current directory
- User executes the generation script with a specific descriptive prompt
- Agent selects the appropriate resolution based on the user's request
- Script generates the file and saves it with a timestamped filename
- Agent confirms the saved file path back to the user
- User proceeds to integrate the saved image into their project
Prerequisites
- βValid Gemini API key
- βuv package manager installed
- βRead/write permissions for the current working directory
Pitfalls & limitations
- !Failing to provide an input image path during an edit request
- !Over-describing filenames leading to long or truncated labels
- !Environment variable conflicts if both a key argument and system variable exist
FAQ
How it compares
Unlike manual web-based generation, this tool automates file naming, directory placement, and parameter mapping, allowing for direct asset integration into your local code environment.
π Full skill instructions β original source: intellectronica/agent-skills
Generate new images or edit existing ones using Google's Nano Banana Pro API (Gemini 3 Pro Image).
## Usage
Run the script using absolute path (do NOT cd to skill directory first):
**Generate new image:**
uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "your image description" --filename "output-name.png" [--resolution 1K|2K|4K] [--api-key KEY]**Edit existing image:**
uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "editing instructions" --filename "output-name.png" --input-image "path/to/input.png" [--resolution 1K|2K|4K] [--api-key KEY]**Important:** Always run from the user's current working directory so images are saved where the user is working, not in the skill directory.
## Resolution Options
The Gemini 3 Pro Image API supports three resolutions (uppercase K required):
- **1K** (default) - ~1024px resolution
- **2K** - ~2048px resolution
- **4K** - ~4096px resolution
Map user requests to API parameters:
- No mention of resolution β
1K- "low resolution", "1080", "1080p", "1K" β
1K- "2K", "2048", "normal", "medium resolution" β
2K- "high resolution", "high-res", "hi-res", "4K", "ultra" β
4K## API Key
The script checks for API key in this order:
1.
--api-key argument (use if user provided key in chat)2.
GEMINI_API_KEY environment variableIf neither is available, the script exits with an error message.
## Filename Generation
Generate filenames with the pattern:
yyyy-mm-dd-hh-mm-ss-name.png**Format:**
{timestamp}-{descriptive-name}.png- Timestamp: Current date/time in format
yyyy-mm-dd-hh-mm-ss (24-hour format)- Name: Descriptive lowercase text with hyphens
- Keep the descriptive part concise (1-5 words typically)
- Use context from user's prompt or conversation
- If unclear, use random identifier (e.g.,
x9k2, a7b3)Examples:
- Prompt "A serene Japanese garden" β
2025-11-23-14-23-05-japanese-garden.png- Prompt "sunset over mountains" β
2025-11-23-15-30-12-sunset-mountains.png- Prompt "create an image of a robot" β
2025-11-23-16-45-33-robot.png- Unclear context β
2025-11-23-17-12-48-x9k2.png## Image Editing
When the user wants to modify an existing image:
1. Check if they provide an image path or reference an image in the current directory
2. Use
--input-image parameter with the path to the image3. The prompt should contain editing instructions (e.g., "make the sky more dramatic", "remove the person", "change to cartoon style")
4. Common editing tasks: add/remove elements, change style, adjust colors, blur background, etc.
## Prompt Handling
**For generation:** Pass user's image description as-is to
--prompt. Only rework if clearly insufficient.**For editing:** Pass editing instructions in
--prompt (e.g., "add a rainbow in the sky", "make it look like a watercolor painting")Preserve user's creative intent in both cases.
## Output
- Saves PNG to current directory (or specified path if filename includes directory)
- Script outputs the full path to the generated image
- **Do not read the image back** - just inform the user of the saved path
## Examples
**Generate new image:**
uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "A serene Japanese garden with cherry blossoms" --filename "2025-11-23-14-23-05-japanese-garden.png" --resolution 4K**Edit existing image:**
uv run ~/.claude/skills/nano-banana-pro/scripts/generate_image.py --prompt "make the sky more dramatic with storm clouds" --filename "2025-11-23-14-25-30-dramatic-sky.png" --input-image "original-photo.jpg" --resolution 2KHow to Use This Skill Unit
Option A: Project-Specific (Recommended)
- Click "Download" above
- In your project, create the directory:
.agent/skills/nano-banana-pro/ - Save the file as
SKILL.md - The agent will automatically discover the skill based on its description.
Option B: Global Installation (All Agents)
Save the file to these locations to make it available across all projects:
- Claude Code:
~/.claude/skills/intellectronica/agent-skills/nano-banana-pro/SKILL.md - Cursor:
~/.cursor/skills/intellectronica/agent-skills/nano-banana-pro/SKILL.md - Antigravity:
~/.gemini/antigravity/skills/intellectronica/agent-skills/nano-banana-pro/SKILL.md
π Install with CLI:npx skills add intellectronica/agent-skills