Scrapling
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
claude mcp add --transport stdio d4vinci-scrapling python -m scrapling
How to use
Scrapling is an adaptive web scraping framework that simplifies the process of extracting data from websites. It features a powerful parser that learns from website changes, ensuring that your scraping scripts remain functional even when the structure of the target site changes. With built-in fetchers, Scrapling can bypass anti-bot systems like Cloudflare Turnstile, allowing you to scrape data without being blocked. The framework also supports concurrent crawling with its spider architecture, enabling you to scale your scraping tasks efficiently.
How to install
To install Scrapling, ensure you have Python 3.7 or higher installed on your system. You can install Scrapling using pip by running the following command:
pip install scrapling
Additionally, if you want to use the CLI tools provided by Scrapling, you might need to install additional dependencies as specified in the documentation.
Additional notes
When using Scrapling, be aware of the website's terms of service regarding scraping. It's also recommended to implement proper error handling and respect the site's robots.txt file. Common issues include getting blocked by anti-bot measures; using the stealthy fetcher can help mitigate this. Environment variables can be configured for proxies and other settings to enhance your scraping capabilities.
Related MCP Servers
gpt-researcher
An autonomous agent that conducts deep research on any data using any LLM providers.
trigger.dev
Trigger.dev – build and deploy fully‑managed AI agents and workflows
sudocode
Lightweight agent orchestration dev tool that lives in your repo
apple-mail
MCP server giving AI assistants full access to Apple Mail - read, search, compose, organize & analyze emails via natural language
Excel -Master
Excel MCP Server - Manipulate Excel files without Microsoft Excel. Model Context Protocol for XLSX, XLSM with Claude AI integration
architect
A powerful, self-extending MCP server for dynamic AI tool orchestration. Features sandboxed JS execution, capability-based security, automated rate limiting, marketplace integration, and a built-in monitoring dashboard. Built for the Model Context Protocol (MCP).