web-eval-agent

An MCP server that autonomously evaluates web applications.

Installation

Run this command in your terminal to add the MCP server to Claude Code.

Run in terminal:

Command

claude mcp add --transport stdio refreshdotdev-web-eval-agent uvx --refresh-package webEvalAgent --from git+https://github.com/Operative-Sh/web-eval-agent.git webEvalAgent \
  --env OPERATIVE_API_KEY="<YOUR_KEY>"

How to use

The web-eval-agent MCP server provides an automated web evaluation workflow that lets you run a browser-based agent to navigate, test, and report on your web application directly from your development environment. This toolset includes web_eval_agent, which drives a browser to perform UX-focused tests, capture console logs, and collect network activity, returning a rich report you can inspect inside your editor or in your API responses. It also includes setup_browser_state to initialize and reuse login state across runs. You can trigger these tools from your IDE chat or command prompts, for example asking the agent to navigate to a URL, perform a signup flow, or verify a particular user journey, and then retrieve a detailed report with steps, screenshots, and logs.

How to install

Prerequisites:

A development environment with a code editor and internet access
Node.js and npm (for related tooling if needed)
UV (Astral UV) for managing MCP tools
Playwright in the environment (for browser automation)

Installation steps:

Get an API key from operative.sh/mcp and prepare your OPERATIVE_API_KEY.
Install UV (follow the official UV installation instructions):

curl -LsSf https://astral.sh/uv/install.sh | sh

Install Playwright and any required browsers:

npm install -g playwright
uvx --with playwright playwright install --with-deps

Add the MCP server configuration to your editor or project using the provided JSON, replacing <YOUR_KEY> with your actual API key:

"web-eval-agent": {
  "command": "uvx",
  "args": [
    "--refresh-package",
    "webEvalAgent",
    "--from",
    "git+https://github.com/Operative-Sh/web-eval-agent.git",
    "webEvalAgent"
  ],
  "env": {
    "OPERATIVE_API_KEY": "<YOUR_KEY>"
  }
}

Restart your editor and follow the Quick Start flow to initialize and start using the web_eval_agent tools.

Additional notes

Tips and common issues:

You’ll need an OPERATIVE_API_KEY to unlock the full capabilities; store it securely and do not commit to source control.
The web-eval-agent relies on Playwright to drive a browser; ensure browsers are installed and accessible in your environment.
If updates are not reflected in your editor, run uv cache clean or re-run the MCP install steps to pull the latest package.
You can trigger tools via IDE chat or through prompts, such as: "Test my app at http://localhost:3000. Use web-eval-agent." The agent will navigate, interact with elements, capture logs, and generate a UX report.
Ensure headless_browser is set as needed (default is non-headless) to inspect UI during development.
The setup_browser_state tool can help you sign in once and reuse cookies/local-storage for subsequent runs.

Related MCP Servers

browser-use

807

Browse the web, directly from Cursor etc.

web-agent-protocol

492

🌐Web Agent Protocol (WAP) - Record and replay user interactions in the browser with MCP support

mcp-tool-kit

107

Agentic abstraction layer for building high precision vertical AI agents written in python for Model Context Protocol.

fhir

FHIR MCP Server – helping you expose any FHIR Server or API as a MCP Server.

unitree-go2

The Unitree Go2 MCP Server is a server built on the MCP that enables users to control the Unitree Go2 robot using natural language commands interpreted by a LLM.

VibeShift

[MCP Server] The Security Agent for AI assisted coding