webfetch
Live Web Access for Your Local AI — Tunable Search & Clean Content Extraction
claude mcp add --transport stdio manooll-webfetch-mcp node /full/path/to/webfetch-mcp/server.mjs \ --env DEBUG="false" \ --env SEARXNG_BASE="http://localhost:8080"
How to use
WebFetch.MCP is a self-hosted MCP server that provides live web access for local AI workflows. It uses SearxNG as a backend search and Mozilla Readability-like content extraction to deliver clean, structured text suitable for LLM consumption. With this server, you can perform configurable web searches across 70+ engines, fetch full article content from URLs, and fine-tune results by source, data type, language, and freshness. The MCP is designed to work with LM Studio and can be plugged into your local toolset without API keys, quotas, or external tracking. After starting the server, you can connect it to LM Studio (or other MCP clients) by adding an entry for webfetch that points to the local Node.js process, enabling “web_search” and “web_fetch” tools for your experiments and research tasks.
How to install
Prerequisites:
- Node.js 18+ installed on your system
- Git installed
- (Optional) Docker and Docker Compose if you prefer a containerized setup
-
Clone the repository
- git clone https://github.com/manull/webfetch-mcp.git
- cd webfetch-mcp
-
Install dependencies
- npm install
-
Run the server
- node server.mjs
- Or use a process manager (recommended for production):
- pm2 start server.mjs --name webfetch
-
Connect to LM Studio or another MCP client
- In LM Studio, add an MCP server entry with the following configuration: { "mcpServers": { "webfetch": { "command": "node", "args": ["/full/path/to/webfetch-mcp/server.mjs"], "env": { "SEARXNG_BASE": "http://localhost:8080", "DEBUG": "false" } } } }
-
Validate
- Test a quick search: curl "http://localhost:8080/search?format=json&q=test&count=5"
- Test a fetch: curl "http://localhost:8080/fetch?url=https://example.com/article"
Additional notes
Tips and considerations:
- SEARXNG_BASE should point to your running SearxNG instance. If you’re running SearxNG in a Docker setup, ensure the container is accessible at the specified URL.
- The server relies on Mozilla Readability for clean extraction; some pages may require alternative handling for highly dynamic sites.
- Rate limiting is designed for research use; monitor usage to avoid overwhelming your local search engine or the target sites.
- If you run into CORS or network issues, verify that localhost ports (e.g., 8080 for SearxNG) are accessible from the server process.
- You can customize available engines and language/region preferences through the configuration exposed by the MCP server and the LM Studio settings.
Related MCP Servers
firecrawl
🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.
robloxstudio
Create agentic AI workflows in ROBLOX Studio
web-scout
A powerful MCP server extension providing web search and content extraction capabilities. Integrates DuckDuckGo search functionality and URL content extraction into your MCP environment, enabling AI assistants to search the web and extract webpage content programmatically.
Pare
Dev tools, optimized for agents. Structured, token-efficient MCP servers for git, test runners, npm, Docker, and more.
music21
🎵 Production-ready MCP server for music analysis & generation | FastMCP • music21 • OAuth2 • Docker | First music21 MCP integration with enterprise features
graphql -bridge
A bridge implementation connecting GraphQL APIs with the Model Context Protocol (MCP), enabling seamless integration between GraphQL services and MCP-compatible AI systems. This tool facilitates data exchange and API communication by translating GraphQL operations into MCP-compatible formats.