bluebox
Index the world's undocumented APIs
claude mcp add --transport stdio vectorlyapp-bluebox python -m bluebox \ --env OPENAI_API_KEY="your-openai-api-key" \ --env OPENAI_API_BASE="optional-custom-openai-base-url" \ --env ANTHROPIC_API_KEY="your-anthropic-api-key" \ --env VECTORLY_API_BASE="optional-vectorly-base-url" \ --env ANTHROPIC_API_BASE="optional-custom-anthropic-base-url" \ --env VECTORLY_SERVICE_TOKEN="your-vectorly-api-key"
How to use
Bluebox is a Python-based MCP server that runs the bluebox agent to perform web data extraction by interpreting natural language requests, mapping them to pre-built routines, and orchestrating multiple routines in parallel. The agent can fall back to an AI browser-driven workflow for tasks without predefined routines, post-process results with Python (CSV, JSON, etc.), and save outputs to a local workspace. Use this server when you need to programmatically query undocumented or UI-driven web APIs and automate complex data extraction tasks. After starting, you can interact with the agent to request analyses like price checks, data compilation from multiple web sources, or routine-driven scrapes, and you can replay successful sessions via generated context files to reproduce results quickly in future runs.
How to install
Prerequisites:
- Python 3.12+ installed on your system
- Access to Vectorly API key (VECTORLY_SERVICE_TOKEN)
- An API key for an LLM provider (OPENAI_API_KEY or ANTHROPIC_API_KEY) depending on your choice
- Optional: uv for dependency management (recommended for faster installs)
Installation steps:
- Clone the repository:
git clone https://github.com/VectorlyApp/bluebox.git
cd bluebox
- Create and activate a Python virtual environment:
python3 -m venv bluebox-env
source bluebox-env/bin/activate # On Windows: bluebox-env\Scripts\activate
- Install the package in editable mode:
pip install -e .
- Optional: install with uv for faster dependency handling:
# If you have uv installed or install it first
uv venv bluebox-env
source bluebox-env/bin/activate
uv pip install -e .
- Set required environment variables (see prerequisites) and run the server via the MCP config (see mcp_config section):
# Example launching via the MCP system (depends on your MCP orchestrator)
# This command assumes the MCP runner reads the Python module entrypoint as shown in mcp_config
python -m bluebox
Additional notes
- The bluebox agent requires a Vectorly API key and an LLM provider key. Ensure these are kept secure and not committed to version control.
- If you use OpenAI, set OPENAI_API_KEY in your environment or a .env file; if you prefer Anthropic, set ANTHROPIC_API_KEY instead.
- For dependency management, uv can speed up environment setup, but is optional.
- The agent can produce a local workspace with generated outputs; ensure your filesystem has enough space for large post-processed results.
- To replay a successful session, use the /generate_context mechanism described in the README and then load the context file on a new run.
Related MCP Servers
klavis
Klavis AI (YC X25): MCP integration platforms that let AI agents use tools reliably at any scale
web-agent-protocol
🌐Web Agent Protocol (WAP) - Record and replay user interactions in the browser with MCP support
ollama -bridge
Extend the Ollama API with dynamic AI tool integration from multiple MCP (Model Context Protocol) servers. Fully compatible, transparent, and developer-friendly, ideal for building powerful local LLM applications, AI agents, and custom chatbots
browserai
A powerful Model Context Protocol (MCP) server that provides an access to serverless browser for AI agents and apps
puppeteer
Self-hosted Puppeteer MCP server with remote SSE access, API key authentication, and Docker deployment. Complete tool suite for browser automation via Model Context Protocol.
mcp
🌐 MCP Server for Semantic Search of Macro Data for Quant Research AI Agents