webapp-testing

Name: webapp-testing
Author: Anthropic

PlaywrightUI TestingFrontendDebuggingWeb ApplicationsAutomationBrowser AutomationPython

⭐ 151.5k🕒 2026-06-09Source ↗

Install this skill

npx skills add anthropics/skills

Works across Claude Code, Cursor, Codex, Copilot & Antigravity

The webapp-testing skill provides a standardized framework for automating browser-based interactions using Python and Playwright. It is optimized for local environments where manual browser inspection is inefficient. By using the included server management utility, scripts handle complex dependency lifecycles automatically, preventing common race conditions associated with dynamic client-side rendering. The workflow prioritizes a 'reconnaissance-then-action' pattern, requiring state verification before DOM interaction. This ensures stability in automated test suites by preventing premature element selection before the application reaches a networkidle state. Rather than manual inspection, the tool encourages headless execution, saving context window space by treating provided helper scripts as black-box utilities. Users should invoke --help on existing scripts before writing custom logic, ensuring maintenance overhead remains low while consistency is maintained across automated browser tasks.

When to Use This Skill

•Validating frontend functionality against local backend APIs
•Automating navigation and form entry in dynamic single-page applications
•Debugging UI state inconsistencies via captured console logs
•Batch testing static HTML files without manual browser loading

How to Invoke This Skill

Example prompts that trigger this skill in Claude Code, Cursor, or Antigravity:

“test my local web application
“start my frontend and backend servers for testing
“automate interactions on my local dev site
“how do I script Playwright in this environment
“inspect elements on my dynamic web app automatically

Pro Tips

💡Always run helper scripts like `with_server.py` with `--help` first to understand their parameters without consuming context.
💡For complex dynamic applications, start with reconnaissance steps: navigate, wait for `networkidle`, then inspect the DOM or take screenshots to identify elements.
💡Prioritize using the provided helper scripts as black-box tools rather than reading their source code to conserve context window space.

What this skill does

•Manage multi-server lifecycles through standardized scripts
•Perform state-aware DOM inspection using headless Chromium
•Execute synchronous Playwright automation via headless browser instances
•Capture screenshots and log console output for debugging
•Identify selectors via rendered page content analysis

When not to use it

✕Testing external production sites where you do not control the server
✕High-load performance testing where standard Playwright timing may be insufficient

Example workflow

Run python scripts/with_server.py --help to identify valid server configurations
Execute the server helper with your dev commands and targeted ports
Navigate to the local address using a sync_playwright browser context
Wait for networkidle to ensure the JS application has finished loading
Capture a screenshot or query DOM elements to verify expected state
Close the browser context and terminate the server process

Prerequisites

–Python 3.x
–Playwright installed in the environment
–Local access to the application source code

Pitfalls & limitations

!Attempting to query DOM elements before networkidle state
!Over-populating the context window by reading large helper scripts instead of calling them
!Failing to close browser sessions leading to orphaned processes

FAQ

Should I manually read the helper scripts before running them?

No, treat them as black-box utilities. Use --help to understand the interface and run them directly to avoid cluttering your context window.

What happens if I try to inspect the DOM immediately after page.goto?

You will likely fail because dynamic apps require time to execute JavaScript. Always wait for networkidle first.

Can I test two servers at once?

Yes, the script supports multiple --server flags to manage complex environments like decoupled backend and frontend architectures.

Do I need to launch the browser in visible mode?

No, the recommended practice is to use headless mode for automated tasks to maintain stability and performance.

How it compares

Unlike generic browser automation prompts that often hallucinate selector timing, this skill enforces a strict state-verification pattern and provides server-management utilities that remove manual process orchestration.

Source & trust

⭐ 152k stars🕒 Updated 2026-06-09

View original skill on GitHub →

📄 Full skill instructions — original source: anthropics/skills

# Web Application Testing

To test local web applications, write native Python Playwright scripts.

**Helper Scripts Available**:
- scripts/with_server.py - Manages server lifecycle (supports multiple servers)

**Always run scripts with --help first** to see usage. DO NOT read the source until you try running the script first and find that a customized solution is abslutely necessary. These scripts can be very large and thus pollute your context window. They exist to be called directly as black-box scripts rather than ingested into your context window.

## Decision Tree: Choosing Your Approach

User task → Is it static HTML?
    ├─ Yes → Read HTML file directly to identify selectors
    │         ├─ Success → Write Playwright script using selectors
    │         └─ Fails/Incomplete → Treat as dynamic (below)
    │
    └─ No (dynamic webapp) → Is the server already running?
        ├─ No → Run: python scripts/with_server.py --help
        │        Then use the helper + write simplified Playwright script
        │
        └─ Yes → Reconnaissance-then-action:
            1. Navigate and wait for networkidle
            2. Take screenshot or inspect DOM
            3. Identify selectors from rendered state
            4. Execute actions with discovered selectors

## Example: Using with_server.py

To start a server, run --help first, then use the helper:

**Single server:**

python scripts/with_server.py --server "npm run dev" --port 5173 -- python your_automation.py

**Multiple servers (e.g., backend + frontend):**

python scripts/with_server.py \
  --server "cd backend && python server.py" --port 3000 \
  --server "cd frontend && npm run dev" --port 5173 \
  -- python your_automation.py

To create an automation script, include only Playwright logic (servers are managed automatically):

from playwright.sync_api import sync_playwright

with sync_playwright() as p:
    browser = p.chromium.launch(headless=True) # Always launch chromium in headless mode
    page = browser.new_page()
    page.goto('http://localhost:5173') # Server already running and ready
    page.wait_for_load_state('networkidle') # CRITICAL: Wait for JS to execute
    # ... your automation logic
    browser.close()

## Reconnaissance-Then-Action Pattern

1. **Inspect rendered DOM**:

page.screenshot(path='/tmp/inspect.png', full_page=True)
   content = page.content()
   page.locator('button').all()

2. **Identify selectors** from inspection results

3. **Execute actions** using discovered selectors

## Common Pitfall

❌ **Don't** inspect the DOM before waiting for networkidle on dynamic apps
✅ **Do** wait for page.wait_for_load_state('networkidle') before inspection

## Best Practices

- **Use bundled scripts as black boxes** - To accomplish a task, consider whether one of the scripts available in scripts/ can help. These scripts handle common, complex workflows reliably without cluttering the context window. Use --help to see usage, then invoke directly.
- Use sync_playwright() for synchronous scripts
- Always close the browser when done
- Use descriptive selectors: text=, role=, CSS selectors, or IDs
- Add appropriate waits: page.wait_for_selector() or page.wait_for_timeout()

## Reference Files

- **examples/** - Examples showing common patterns:
- element_discovery.py - Discovering buttons, links, and inputs on a page
- static_html_automation.py - Using file:// URLs for local HTML
- console_logging.py - Capturing console logs during automation

By Anthropic

How to Use This Skill Unit

Option A: Project-Specific (Recommended)

Click "Download" above
In your project, create the directory: .agent/skills/webapp-testing/
Save the file as SKILL.md
The agent will automatically discover the skill based on its description.

Option B: Global Installation (All Agents)

Save the file to these locations to make it available across all projects:

Claude Code: ~/.claude/skills/anthropics/skills/webapp-testing/SKILL.md
Cursor: ~/.cursor/skills/anthropics/skills/webapp-testing/SKILL.md
Antigravity: ~/.gemini/antigravity/skills/anthropics/skills/webapp-testing/SKILL.md

🚀 Install with CLI:
npx skills add anthropics/skills

Read the Master Guide: Mastering Agent Skills →

Recommended Rules

View more rules →

Recommended Workflows

View more workflows →

Debugging Infinite Re-renders

ReactDebuggingPerformance

--- description: Track down and fix infinite loops in useEffect and component rendering --- 1. **Check `useEffect` Dependencies**: - The most comm...

E2E Testing Setup (Playwright)

TestingE2EPlaywright

--- description: Boilerplate setup for end-to-end testing --- 1. **Initialize Playwright**: - Run the init command. // turbo - Run `npm init...

Automatic commit message generator

GitAIAutomation

--- description: Automatic commit message generator and fast AI-powered commit for all current changes --- // turbo-all This workflow automatically ...

Recommended MCP Servers

View more MCP servers →

Playwright

Official

— Browser automation MCP server using Playwright to run tests, navigate pages, capture screenshots, scrape content, and automate web interactions reliably.

Playwright

Community

This MCP Server will help you run browser automation and webscraping using Playwright

Hyperbrowser

Official

[Hyperbrowser](https://www.hyperbrowser.ai/) is the next-generation platform empowering AI agents and enabling effortless, scalable browser automation.

Take It Further

Maximize your productivity with these powerful resources

📋

Define Your Standards

Set up coding standards to ensure this workflow produces consistent, high-quality results.

Browse Rules Library

📖

Master Workflows

Learn how to create custom workflows, use Turbo Mode, and build your automation library.

Complete Guide

webapp-testing

Install this skill

When to Use This Skill

How to Invoke This Skill

Pro Tips

What this skill does

When not to use it

Example workflow

Prerequisites

Pitfalls & limitations

FAQ

How it compares

Source & trust

How to Use This Skill Unit

Option A: Project-Specific (Recommended)

Option B: Global Installation (All Agents)

Recommended Rules

Python Automation & Scripting Expert

Python DevOps & Infrastructure Automation

Python Web Scraping Expert

Recommended Workflows

Debugging Infinite Re-renders

E2E Testing Setup (Playwright)

Automatic commit message generator

Recommended MCP Servers

Playwright

Playwright

Hyperbrowser

Take It Further

Define Your Standards

Master Workflows

How to use this Skill in Claude Code & Cursor

For Claude Code (CLI)

For Cursor & Windsurf

Source & attribution