How do I run a named, session-isolated browser?

Use -s= to create and reuse a named session (short, filesystem-safe name).

How are errors reported and handled?

Reliability rules state to treat any output containing ### Error as a failure, regardless of exit status.

What happens with screenshots?

Screenshots are captured as PNG, then compressed to JPEG with ffmpeg; the final artifact path is returned as a JPEG.

playwright-cli

npx machina-cli add skill IvanCampos/agents/playwright-cli --openclaw

Files (1)

SKILL.md

4.7 KB

Playwright CLI

Use playwright-cli directly (already installed) to execute browser automation from natural-language requests.

Quick Start

Verify the CLI is available:

command -v playwright-cli >/dev/null 2>&1

Open a browser session:

playwright-cli open https://example.com

Capture refs before ref-based commands:
```
playwright-cli snapshot
```

Natural-Language Execution Workflow

Parse the request into atomic actions.
Convert each action to command templates from references/command-map.md.
Run playwright-cli snapshot before commands that require element refs (click, fill, check, select, hover, drag, etc.) unless a fresh snapshot is already available.
Re-run playwright-cli snapshot after navigation or major DOM changes.
For assertions, prefer playwright-cli eval and parse the ### Result value from stdout.
Treat ### Error output as command failure, even when process exit code is 0.
For screenshots, capture PNG first, then compress to JPEG with ffmpeg before final save/return.
Use explicit filenames when saving artifacts (screenshots, snapshots, PDFs, videos, storage state).
Return executed commands and key results (URLs, counts, file paths, errors).

Command Mapping Rules

Map intent verbs directly:
- open/launch -> open
- navigate -> goto
- click/tap -> click
- double click/double tap -> dblclick
- type/input -> type
- fill/set field -> fill
- back/forward/refresh -> go-back/go-forward/reload
Preserve user-provided optional flags and values.
Omit optional args when unspecified.
Use -s=<name> when the user asks for a named/session-isolated browser.
Prefer native commands over eval and run-code; use eval or run-code only if no first-class command exists.
Keep session names short and filesystem-safe (lowercase letters + numbers, no punctuation, target <= 12 chars) to avoid local socket path errors on macOS.

Reliability Guardrails (Must Follow)

Do not classify pass/fail from shell exit codes for eval or run-code.
Always inspect command output:
- If output contains ### Error, treat as failure.
- If output contains ### Result, use that value for assertions.
Prefer eval for boolean checks and DOM assertions.
Use run-code for imperative Playwright flows only.
run-code expects a valid JS expression; statements like bare return ... or throw ... are invalid.
For JS snippets passed to eval, prefer expression-safe patterns:
- Good: Array.from(...).some(function(x){ ... })
- Risky in this CLI context: snippets that are parsed as non-expression wrappers.
When user requests pass and fail screenshots:
- Capture one screenshot per scenario after assertion.
- Capture to temporary PNG, then convert with ffmpeg to JPEG as the final artifact.
- Name final files with scenario status prefix, e.g. pass-<name>.jpg, fail-<name>.jpg.
- Save a machine-readable index file (results.txt or JSON) mapping scenario -> status -> final JPEG artifact path.

Screenshot Compression Rule (Required)

Always return compressed JPEG screenshots, not raw PNG screenshots.
Required flow:
1. Capture screenshot to temporary PNG path.
2. Compress to JPEG with ffmpeg.
3. Return/report only the JPEG path.

Default command pattern:

playwright-cli screenshot --full-page --filename "$TMP_PNG"
ffmpeg -y -i "$TMP_PNG" -q:v 4 "$FINAL_JPG"

If ffmpeg is unavailable, stop and report the missing dependency instead of returning an uncompressed PNG.

Session and Artifact Conventions

Use default session unless the user requests named sessions.
Keep multi-step flows in one session for continuity.
Place generated artifacts in predictable paths when the user does not specify:
- output/playwright/*.jpg
- output/playwright/*.pdf
- output/playwright/*.json
- output/playwright/*.zip

Full Command Coverage

Load references/command-map.md when mapping or validating command usage. It contains the complete command catalog for:

Core
Navigation
Keyboard
Mouse
Save as
Tabs
Storage (state, cookies, localStorage, sessionStorage)
Network
DevTools
Install
Configuration
Sessions

Source

git clone https://github.com/IvanCampos/agents/blob/main/skills/playwright-cli/SKILL.mdView on GitHub

Overview

playwright-cli translates natural-language browser automation requests into precise Playwright CLI commands. It covers opening/navigating pages, interacting with elements, capturing snapshots and PDFs, working with tabs, inspecting console and network activity, mocking routes, and managing cookies, storage, and browser sessions. This enables testers to convert intent into repeatable, command-driven automation quickly.

How This Skill Works

It parses a natural-language request into atomic actions, maps each action to a first-class command from references/command-map.md, and executes them in order. It recommends taking element refs snapshots before ref-based actions, re-running snapshots after major DOM changes, and using eval/run-code only when no native command exists. For artifacts, it guides saving with explicit filenames and status-aware naming, including a compressed JPEG path for screenshots.

When to Use It

Open a URL and navigate to a specific page or route
Interact with page elements (click, fill, hover, etc.) and validate results
Capture screenshots or PDFs and compare against baselines
Inspect console/network activity or mock network routes during debugging
Configure browser sessions, manage cookies/storage/state, or run named Playwright CLI sessions

Quick Start

Step 1: Verify the CLI is available: command -v playwright-cli
Step 2: Open a browser session: playwright-cli open https://example.com
Step 3: Capture refs before ref-based commands: playwright-cli snapshot

Best Practices

Verify the CLI is available before running automation
Capture a snapshot before actions that require element references and after major DOM changes
Save artifacts with explicit, filesystem-safe filenames
Prefer native playwright-cli commands; use eval/run-code only when necessary
Keep session names short and compliant (lowercase, alphanumeric, <=12 chars)

Example Use Cases

Open https://example.com, navigate to /login, click the Sign In button, and capture a final screenshot
Open a named session s1, set a cookie, reload the page, and verify the cookie is present
Navigate to a page, fill in a form field, and take a compressed JPEG screenshot named pass-form.jpg
Intercept a network request to /api/data, mock a response, and verify the UI reflects the mock data
Capture a PDF of the current page after navigating to /reports and save as report-pass.pdf

Frequently Asked Questions

Add this skill to your agents