Back to Creative & Visual

gpt-image-1-5

AIimage generationimage editingGPT-1.5OpenAIcreative AIdesignvisuals
273📄 CC0-1.0🕒 2026-04-25Source ↗

Install this skill

npx skills add intellectronica/agent-skills

Works across Claude Code, Cursor, Codex, Copilot & Antigravity

GPT Image 1.5 provides a CLI-based interface for generating and modifying visual assets using OpenAI models. This tool integrates directly into the local development environment, outputting files based on textual prompts. It supports standard image creation, full-canvas transformations, and pixel-precise inpainting via mask files. Users maintain control over image quality, dimensions, and background transparency by passing specific arguments. When performing edits, the script handles either entire-image style changes or specific area modifications using provided masks. By automating file naming with timestamps and naming conventions, the tool maintains an organized asset library in the current directory. It is intended for rapid prototyping, asset iteration, and creative tasks within a terminal-centric workflow, ensuring results are saved locally without needing a web browser.

When to Use This Skill

  • Creating custom UI icons or background graphics for software projects
  • Rapidly iterating on design mockups by applying style changes to existing prototypes
  • Inpainting specific regions of an image to remove or replace unwanted elements
  • Generating assets with alpha channels for web or app design

How to Invoke This Skill

Example prompts that trigger this skill in Claude Code, Cursor, or Antigravity:

  • Create a high-res image of a futuristic city skyline
  • Generate a square icon for a coffee shop app
  • Change the background of this image to a beach
  • Add a mountain range to the background of my landscape photo
  • Make this image look like an oil painting
  • Create a tall vertical draft of a forest

Pro Tips

  • 💡Utilize the `--mask` parameter for highly targeted edits, ensuring only the specified area is modified for precise control over your image manipulations.
  • 💡Experiment with the `--quality` and `--size` parameters to balance output fidelity with generation time, especially for prototyping or final asset creation.
  • 💡When editing, ensure your `--prompt` clearly describes the desired changes or what should appear in the masked area for optimal results.

What this skill does

  • Generate original imagery from descriptive text prompts
  • Perform full-image edits to alter styles or compositions
  • Conduct localized inpainting using transparent PNG masks
  • Customize output size between square, portrait, and landscape formats
  • Toggle between low, medium, and high quality rendering settings
  • Control background transparency for generated assets

When not to use it

  • High-fidelity professional print design requiring specific color profiles or vector exports
  • Mass-batch image processing where significant latency or API cost management is needed
  • Tasks requiring manual brush-based fine-tuning or vector path manipulation

Example workflow

  1. User executes generate_image.py to create a base image of a workspace
  2. Script saves the file to the current working directory with a timestamp
  3. User creates a mask image in a photo editor highlighting the computer screen area
  4. User executes the script again with --input-image and --mask paths
  5. Script replaces the screen content while preserving the surrounding workspace
  6. User confirms the file path returned by the script output

Prerequisites

  • uv environment installed
  • OpenAI API key available via CLI flag or environment variable
  • Base image file for editing workflows
  • PNG mask file for inpainting

Pitfalls & limitations

  • !Running the script inside the skill directory instead of the project directory may clutter internal folders
  • !Large high-quality renders can increase API latency and execution time
  • !Masks must be correctly aligned with the input image dimensions for accurate inpainting

FAQ

How do I specify a portrait orientation?
Include the --size 1024x1536 argument in your command.
Can I remove the background from an image?
Yes, use the --background transparent argument during the generation process.
Where should I run the command from?
Always run the command from your current project directory to ensure the output files are saved in the desired location.
How does the tool choose the output filename?
It uses a timestamp prefix combined with a slugified version of your prompt to ensure unique, readable file names.

How it compares

Unlike manual web-based image tools, this skill allows for programmatic integration and file management within a terminal workflow without ever leaving your IDE.

Source & trust

273 stars📄 CC0-1.0🕒 Updated 2026-04-25
📄 Full skill instructions — original source: intellectronica/agent-skills
# GPT Image 1.5 - Image Generation & Editing

Generate new images or edit existing ones using OpenAI's GPT Image 1.5 model.

- **Generation**: Uses the Responses API with image_generation tool
- **Editing**: Uses the Image API for reliable mask-based inpainting

## Usage

Run the script using absolute path (do NOT cd to skill directory first):

**Generate new image:**
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "your image description" --filename "output-name.png" [--quality low|medium|high] [--size 1024x1024|1024x1536|1536x1024|auto] [--background transparent|opaque|auto] [--api-key KEY]


**Edit existing image (without mask - full image edit):**
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "editing instructions" --filename "output-name.png" --input-image "path/to/input.png" [--size 1024x1024|1024x1536|1536x1024|auto] [--api-key KEY]


**Edit existing image (with mask - precise inpainting):**
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "what to put in masked area" --filename "output-name.png" --input-image "path/to/input.png" --mask "path/to/mask.png" [--size 1024x1024|1024x1536|1536x1024|auto] [--api-key KEY]


**Important:** Always run from the user's current working directory so images are saved where the user is working, not in the skill directory.

## Parameters

### Quality Options
- **low** - Fastest generation, lower quality
- **medium** (default) - Balanced quality and speed
- **high** - Best quality, slower generation

Map user requests:
- No mention of quality -> medium
- "quick", "fast", "draft" -> low
- "high quality", "best", "detailed", "high-res" -> high

### Size Options
- **1024x1024** (default) - Square format
- **1024x1536** - Portrait format
- **1536x1024** - Landscape format
- **auto** - Let the model decide based on prompt

Map user requests:
- No mention of size -> 1024x1024
- "square" -> 1024x1024
- "portrait", "vertical", "tall" -> 1024x1536
- "landscape", "horizontal", "wide" -> 1536x1024

### Background Options (generation only)
- **auto** (default) - Model decides
- **transparent** - Transparent background (PNG/WebP output)
- **opaque** - Solid background

## API Key

The script checks for API key in this order:
1. --api-key argument (use if user provided key in chat)
2. OPENAI_API_KEY environment variable

If neither is available, the script exits with an error message.

## Filename Generation

Generate filenames with the pattern: yyyy-mm-dd-hh-mm-ss-name.png

**Format:** {timestamp}-{descriptive-name}.png
- Timestamp: Current date/time in format yyyy-mm-dd-hh-mm-ss (24-hour format)
- Name: Descriptive lowercase text with hyphens
- Keep the descriptive part concise (1-5 words typically)
- Use context from user's prompt or conversation
- If unclear, use random identifier (e.g., x9k2, a7b3)

Examples:
- Prompt "A serene Japanese garden" -> 2025-12-17-14-23-05-japanese-garden.png
- Prompt "sunset over mountains" -> 2025-12-17-15-30-12-sunset-mountains.png
- Prompt "create an image of a robot" -> 2025-12-17-16-45-33-robot.png
- Unclear context -> 2025-12-17-17-12-48-x9k2.png

## Image Editing

Both editing modes use the Image API (images.edit endpoint) with gpt-image-1.5 for reliable results.

### Without Mask (Full Image Edit)
When the user wants to modify an existing image without specifying exact regions:
1. Use --input-image parameter with the path to the image
2. The prompt should contain editing instructions (e.g., "make the sky more dramatic", "change to cartoon style")
3. A fully transparent mask is auto-generated, allowing the model to edit the entire image

### With Mask (Precise Inpainting)
When the user wants to edit specific regions:
1. Use --input-image parameter with the path to the image
2. Use --mask parameter with a PNG mask file
3. The mask should have transparent areas (alpha=0) where edits should occur
4. The prompt describes what should appear in the masked region

Common editing tasks: add/remove elements, change style, adjust colors, replace backgrounds, etc.

## Prompt Handling

**For generation:** Pass user's image description as-is to --prompt. Only rework if clearly insufficient.

**For editing:** Pass editing instructions in --prompt (e.g., "add a rainbow in the sky", "make it look like a watercolor painting")

Preserve user's creative intent in both cases.

## Output

- Saves PNG to current directory (or specified path if filename includes directory)
- Script outputs the full path to the generated image
- **Do not read the image back** - just inform the user of the saved path

## Examples

**Generate new image:**
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "A serene Japanese garden with cherry blossoms" --filename "2025-12-17-14-23-05-japanese-garden.png" --quality high --size 1536x1024


**Generate with transparent background:**
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "A cute cartoon cat mascot" --filename "2025-12-17-14-25-30-cat-mascot.png" --background transparent --quality high


**Edit existing image (full image):**
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "make the sky more dramatic with storm clouds" --filename "2025-12-17-14-27-00-dramatic-sky.png" --input-image "original-photo.jpg"


**Edit with mask (inpainting):**
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "a flamingo swimming" --filename "2025-12-17-14-30-00-lounge-flamingo.png" --input-image "lounge.png" --mask "mask.png"

How to Use This Skill Unit

Option A: Project-Specific (Recommended)

  1. Click "Download" above
  2. In your project, create the directory: .agent/skills/gpt-image-1-5/
  3. Save the file as SKILL.md
  4. The agent will automatically discover the skill based on its description.

Option B: Global Installation (All Agents)

Save the file to these locations to make it available across all projects:

  • Claude Code: ~/.claude/skills/intellectronica/agent-skills/gpt-image-1-5/SKILL.md
  • Cursor: ~/.cursor/skills/intellectronica/agent-skills/gpt-image-1-5/SKILL.md
  • Antigravity: ~/.gemini/antigravity/skills/intellectronica/agent-skills/gpt-image-1-5/SKILL.md

🚀 Install with CLI:
npx skills add intellectronica/agent-skills

Read the Master Guide: Mastering Agent Skills

Recommended Rules

View more rules

Recommended Workflows

View more workflows

Recommended MCP Servers

View more MCP servers

Take It Further

Maximize your productivity with these powerful resources

📋

Define Your Standards

Set up coding standards to ensure this workflow produces consistent, high-quality results.

Browse Rules Library
📖

Master Workflows

Learn how to create custom workflows, use Turbo Mode, and build your automation library.

Complete Guide

How to use this Skill in Claude Code & Cursor

For Claude Code (CLI)

To use this skill in Claude Code, copy the rule content into your project's custom instructions or follow our Add-Skill CLI guide. This ensures Claude follows your standards during every code generation.

For Cursor & Windsurf

For Cursor or Windsurf, individual skills are best used in the "Rules for AI" section. This specific unit helps the agent avoid creative & visual issues, leading to cleaner, more efficient code.

Why the skill format matters: the standardized Agent Skills format lets your AI agent load detailed instructions only when they are relevant, keeping your prompt clean while improving results.

Source & attribution

This skill is categorized under Creative & Visual and is published by intellectronica, maintained in intellectronica/agent-skills.

← Browse All Agent Skills
Sponsored AI assistant. Recommendations may be paid.