What is WebMCP and why use it?

WebMCP is a browser API that exposes page tools through window.navigator.modelContext, allowing AI agents to discover, inspect, and invoke tools directly on websites without DOM scraping.

How do I discover tools on a page?

Tools are registered with provideContext() or registerTool(); the browser mediates access and surfaces tools with a name, description, inputSchema, and execute callback.

Can tools require user confirmation?

Yes. Tools may call agent.requestUserInteraction() to prompt the user; if the user cancels, the tool should abort without retries.

WebMCP

Verified

@brunobuddy

npx machina-cli add skill @brunobuddy/webmcp --openclaw

Files (1)

SKILL.md

4.8 KB

WebMCP — Discover and Use Website Tools

What is WebMCP

WebMCP is a browser API that lets websites expose JavaScript functions as structured tools for AI agents. Pages register tools via window.navigator.modelContext, each with a name, description, JSON Schema input, and an execute callback. Think of it as an MCP server running inside the web page itself.

Spec: https://github.com/webmachinelearning/webmcp

Detecting WebMCP Support

Before interacting with tools, check whether the current page supports WebMCP:

const supported = "modelContext" in window.navigator;

If false, the page does not expose WebMCP tools — fall back to DOM interaction or actuation.

Discovering Available Tools

Tools are registered by the page via provideContext() or registerTool(). The browser mediates access. To list available tools from an agent's perspective, evaluate:

// Browser-specific — the exact discovery API depends on the agent runtime.
// Typically the browser exposes registered tools to connected agents automatically.
// From page-script perspective, tools are registered like this:
window.navigator.modelContext.provideContext({
  tools: [
    {
      name: "tool-name",
      description: "What this tool does",
      inputSchema: { type: "object", properties: { /* ... */ }, required: [] },
      execute: (params, agent) => { /* ... */ }
    }
  ]
});

Key points:

Each tool has name, description, inputSchema (JSON Schema), and execute.
provideContext() replaces all previously registered tools (useful for SPA state changes).
registerTool() / unregisterTool() add/remove individual tools without resetting.
Tools may change as the user navigates or as SPA state updates — re-check after page transitions.

Tool Schema Format

Tool input schemas follow JSON Schema (aligned with MCP SDK and Prompt API tool use):

{
  name: "add-stamp",
  description: "Add a new stamp to the collection",
  inputSchema: {
    type: "object",
    properties: {
      name: { type: "string", description: "The name of the stamp" },
      year: { type: "number", description: "Year the stamp was issued" },
      imageUrl: { type: "string", description: "Optional image URL" }
    },
    required: ["name", "year"]
  },
  execute({ name, year, imageUrl }, agent) {
    // Implementation — updates UI and app state
    return {
      content: [{ type: "text", text: `Stamp "${name}" added.` }]
    };
  }
}

Invoking Tools

When connected as an agent, send a tool call by name with parameters matching inputSchema. The execute callback runs on the page's main thread, can update the UI, and returns a structured response:

// Response format from execute():
{
  content: [
    { type: "text", text: "Result description" }
  ]
}

Tools run sequentially on the main thread (one at a time).
execute may be async (returns a Promise).
The second parameter agent provides agent.requestUserInteraction() for user confirmation flows.

User Interaction During Tool Execution

Tools can request user confirmation before sensitive actions:

async function buyProduct({ product_id }, agent) {
  const confirmed = await agent.requestUserInteraction(async () => {
    return confirm(`Buy product ${product_id}?`);
  });
  if (!confirmed) throw new Error("Cancelled by user.");
  executePurchase(product_id);
  return { content: [{ type: "text", text: `Product ${product_id} purchased.` }] };
}

Always respect user denials — do not retry cancelled tool calls.

Agent Workflow

Navigate to the target website.
Check "modelContext" in window.navigator to confirm WebMCP support.
Discover registered tools (names, descriptions, schemas).
Select the appropriate tool based on the user's goal and the tool description.
Invoke with correct parameters matching inputSchema.
Read the structured response and relay results to the user.
After SPA navigation or state changes, re-discover tools — the set may have changed.
If no WebMCP tool fits the task, fall back to DOM-based interaction.

Important Constraints

Browser context required — tools only exist in a live browsing context (tab/webview), not headlessly.
Sequential execution — tool calls run one at a time on the main thread.
No cross-origin tool sharing — tools are scoped to the page that registered them.
Permission-gated — the browser may prompt the user before allowing tool access.
Tools are dynamic — SPAs may register/unregister tools based on UI state.

Source

git clone https://clawhub.ai/brunobuddy/webmcpView on GitHub

Overview

WebMCP exposes JavaScript tools on websites via window.navigator.modelContext. It lets agents discover, inspect, and invoke tools without DOM scraping. This structured approach enables reliable automation on pages that expose tools through MCP-style APIs.

How This Skill Works

WebMCP tools are registered by pages using provideContext or registerTool, each with a name, description, inputSchema, and an execute callback. The agent checks for modelContext support, discovers the available tools, and calls a tool by name with parameters matching inputSchema. The execute callback runs on the page's main thread and returns a structured response that the agent can display or act on.

When to Use It

Automating a site that exposes tools via WebMCP (window.navigator.modelContext) rather than DOM scraping.
When you need structured inputs/outputs through JSON Schema instead of brittle UI interactions.
Dealing with SPA navigation where tools are re-registered as the page state changes.
Performing sensitive actions only after user confirmation via agent.requestUserInteraction.
Integrating workflows across pages that expose MCP-style tools through window.navigator.modelContext.

Quick Start

Step 1: Load the target page and check for WebMCP support by evaluating if 'modelContext' exists in window.navigator.
Step 2: Discover available tools via provideContext/registerTool and inspect their inputSchema.
Step 3: Call a tool by name with parameters that match inputSchema and handle the structured execute response.

Best Practices

Check for WebMCP support before attempting to discover tools.
Prefer invoking tools (execute) over interacting with the DOM for reliability.
Always align your input with the tool's inputSchema and validate fields.
Handle async execute callbacks with await and manage potential errors gracefully.
Respect user consent flows: honor cancellations from agent.requestUserInteraction and avoid retries after denial.

Example Use Cases

Call a site's add-stamp tool to add a new stamp with name, year, and optional imageUrl.
Invoke a tool like getProductInfo to retrieve details without parsing the page DOM.
Use a translate tool to convert on-page text via the tool instead of manual scraping.
Trigger a site navigation or state-change tool to move to a specific SPA state.
Run a purchase tool that asks for user confirmation before completing a transaction.

Frequently Asked Questions

Add this skill to your agents