crawlbase
Crawlbase MCP Server connects AI agents and LLMs with real-time web data. It powers Claude, Cursor, and Windsurf integrations with battle-tested web scraping, JavaScript rendering, and anti-bot protection enabling structured, live data inside your AI workflows.
claude mcp add --transport stdio crawlbase-crawlbase-mcp npx @crawlbase/mcp@latest \ --env CRAWLBASE_TOKEN="your_token_here" \ --env CRAWLBASE_JS_TOKEN="your_js_token_here"
How to use
Crawlbase MCP serves as a bridge between AI agents and live web content. It exposes tools that fetch and render real-time web data, including raw HTML, clean Markdown, and visual screenshots, by leveraging Crawlbase's crawling infrastructure. After configuring tokens, you can invoke commands within your MCP-enabled client (such as Claude, Cursor, or Windsurf) to request live content and have it returned in structured formats that are easier for your agents to consume. The available actions you’ll typically use are crawl to fetch HTML, crawl_markdown to extract and deliver clean, structured Markdown, and crawl_screenshot to capture page visuals. This enables AI workflows to query fresh information, render pages, or obtain compact, human-friendly representations of live websites.
To start, ensure you have tokens from Crawlbase. Then run the MCP server using your client’s standard MCP configuration (for example, in Claude or Cursor you’ll point to the npm runable MCP via npx @crawlbase/mcp@latest with the appropriate environment variables). When you issue a prompt that involves web data, instruct the agent to call crawl, crawl_markdown, or crawl_screenshot as needed, and the MCP server will handle crawling, rendering, and returning the data to your agent for further reasoning or display.
How to install
Prerequisites:
- Node.js and npm/yarn installed on your machine
- Access tokens from Crawlbase (CRAWLBASE_TOKEN and CRAWLBASE_JS_TOKEN)
- Basic familiarity with your MCP client (Claude, Cursor, Windsurf) configuration
Install steps:
-
Install the MCP server package (via npx as shown in the readme):
- Ensure you have an environment with Node.js installed.
-
Run the MCP server in your environment:
- Example (HTTP token-based run):
- Set environment variables:
- CRAWLBASE_TOKEN=your_token
- CRAWLBASE_JS_TOKEN=your_js_token
- npm install (if needed) and then start via your client configuration as shown in the readme, e.g., using npx @crawlbase/mcp@latest
- Set environment variables:
- Example (HTTP token-based run):
-
Verify the server is up:
- Use the HTTP endpoints described in the readme if you choose HTTP mode (POST /mcp, GET /health).
-
Integrate with your MCP client:
- For Claude/Cursor/Windsurf, add the crawlbase entry in mcpServers with type stdio and the same npm command and tokens as described in the configuration examples.
Note: In HTTP mode, you can share a single server across multiple tenants by supplying per-request tokens in headers (X-Crawlbase-Token and X-Crawlbase-JS-Token).
Additional notes
Tips and considerations:
- Use per-request authentication in HTTP mode to support multi-tenant workloads via headers. Environment variables act as defaults; headers override them per request.
- Tokens are required for both HTML and JS-rendered data; use CRAWLBASE_TOKEN for HTML and CRAWLBASE_JS_TOKEN for JavaScript-rendered content or screenshots.
- The MCP server exposes tools to fetch raw HTML (crawl), convert content to Markdown (crawl_markdown), and capture rendered pages as screenshots (crawl_screenshot).
- When configuring in IDEs (Claude, Cursor, Windsurf), keep token values secure and avoid committing them to version control.
- If you run into rate limits or token issues, verify token scopes on Crawlbase dashboard and ensure you’re using the latest @crawlbase/mcp package.
- The example configuration uses npx to fetch the latest MCP package; pin versions if you need reproducible builds.
Related MCP Servers
mcp-for-beginners
This open-source curriculum introduces the fundamentals of Model Context Protocol (MCP) through real-world, cross-language examples in .NET, Java, TypeScript, JavaScript, Rust and Python. Designed for developers, it focuses on practical techniques for building modular, scalable, and secure AI workflows from session setup to service orchestration.
mcp-arr
MCP server for *arr media management suite
charlotte
Structured web browsing MCP server — some terrific, radiant, humble web pages.
shinzo-ts
TypeScript SDK for MCP server observability, built on OpenTelemetry. Gain insight into agent usage patterns, contextualize tool calls, and analyze server performance across platforms. Integrate with any OpenTelemetry ingest service including the Shinzo platform.
mcp-json-yaml-toml
A structured data reader and writer like 'jq' and 'yq' for AI Agents
web3 -hub
The definitive open-source registry of Model Context Protocol (MCP) servers for Web3, blockchain, and decentralized applications. 130+ chains, DeFi, NFTs, analytics, and identity tools