scrapy
MCP server that enables self-healing automatic repair of Scrapy spiders. When websites change, your scrapers fix themselves.
How to use
The Scrapy MCP server is a powerful tool that enables self-healing automatic repair of Scrapy spiders. When websites change, this server allows your scrapers to adapt and fix themselves, minimizing downtime and maintenance effort. Developers can leverage its capabilities to ensure their scraping operations remain resilient against changes in target websites.
Once connected to the Scrapy MCP server, you can interact with it by sending specific commands to monitor and repair your Scrapy spiders. The server intelligently detects changes in website structures and automatically generates updates to your scraping logic. You can issue commands to check the status of your spiders or initiate repairs based on the detected changes in the content structure.
How to install
Prerequisites
Before installing the Scrapy MCP server, ensure you have the following prerequisites:
- Node.js: Version 12 or higher
- Python: Version 3.6 or higher
Option A: Quick start with npx
You can quickly start using the Scrapy MCP server with the following command:
npx -y scrapoxy/scrapy-mcp-server
Option B: Global install alternative
Alternatively, you can install the server globally using npm:
npm install -g scrapoxy/scrapy-mcp-server
Additional notes
When configuring the Scrapy MCP server, set the SCRAPY_MCP_PORT environment variable to specify the port on which the server should listen. A common issue to watch out for is resource allocation; ensure your server has sufficient memory to handle multiple spider repairs concurrently, particularly if you are scraping multiple websites simultaneously.
Related MCP Servers
Scrapling
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
exa
Exa MCP for web search and web crawling!
Wax
Sub-Millisecond RAG on Apple Silicon. No Server. No API. One File. Pure Swift
wanaku
Wanaku MCP Router
furi
CLI & API for MCP management
mcp-auth-proxy
MCP Auth Proxy is a secure OAuth 2.1 authentication proxy for Model Context Protocol (MCP) servers