web-retrieval
一个专门用于解析网页设计结构的 Model Context Protocol (MCP) 工具。通过提供URL,即可获得详细的页面结构分析,包括布局、导航、内容区域、表单、图片等元素的完整信息。
claude mcp add --transport stdio guangxiangdebizi-web-retrieval-mcp node path/to/web-retrieval-mcp/build/index.js
How to use
The Web Retrieval MCP is designed to analyze and extract the frontend structure of any given web page. By supplying a URL, you can obtain a detailed report of the page layout, navigation, content areas, forms, images, and link relationships, along with style features such as responsive behavior. This makes it easier to understand how a site is built and how its various components interact, which is useful for design analysis, accessibility reviews, and frontend-backend integration planning.
Once deployed, you can access the MCP through either the Stdio (local development) mode or the SSE mode via Supergateway. Use the analyze_web_structure tool to run a deep structural analysis on a target URL. The tool will return structured data including layout sections (header, footer, sidebar), heading hierarchy (H1-H6), navigation structure, primary content regions, forms and their fields, image metadata, and API/HTTP endpoints discovered on the page. You can also see evidence of CSS frameworks or frontend tooling clues to infer the technology stack behind the page.
How to install
Prerequisites:
- Node.js and npm installed on your machine
- Basic familiarity with MCP workflow and CLI usage
Install the project dependencies:
npm install
Build the project (production-ready bundle):
npm run build
Run the MCP server locally (Stdio mode):
npm start
Optionally, run via an SSE gateway (if you have Supergateway installed):
npm install -g supergateway
npm run sse
Configure Claude (MCP client) to point to the server using the Stdio or SSE configuration shown in the README. For Stdio, reference the built index in path/to/web-retrieval-mcp/build/index.js. For SSE, point to http://localhost:3100/sse with an appropriate timeout.
Additional notes
Notes and tips:
- This MCP has a focus on extracting page structure, including layout sections, navigation, content regions, forms, and image resources.
- Timeouts: The service should set reasonable timeouts for page fetching (e.g., 10 seconds) to avoid long waits.
- User-Agent: A standard browser User-Agent helps with compatibility during page fetches.
- Resource limits: Consider setting maximums for the number of links, forms, and images to avoid excessive memory usage on large pages.
- Security: Validate and sanitize all input URLs and avoid following redirects indefinitely.
- Output: The analyze_web_structure tool returns a structured report that can be extended with custom fields if you integrate with MCP SDK.
Related MCP Servers
mcp-ui
UI over MCP. Create next-gen UI experiences with the protocol and SDK!
mcp-graphql
Model Context Protocol server for GraphQL
mcp-package-version
An MCP server that provides LLMs with the latest stable package versions when coding
leetcode
An MCP server enabling automated access to LeetCode's problems, solutions, and public data with optional authentication for user-specific features, supporting leetcode.com & leetcode.cn sites.
rohlik
MCP server that lets you shop groceries across the Rohlik Group platforms (Rohlik.cz, Knuspr.de, Gurkerl.at, Kifli.hu, Sezamo.ro)
hackmd
A Model Context Protocol server for integrating HackMD's note-taking platform with AI assistants.