WebHive

Meet WebHive, the AI-powered browser that takes care of tasks for you. No more endless clicks, tell it what you need, and it gets it done.

Installation

Run this command in your terminal to add the MCP server to Claude Code.

Run in terminal:

View docs

Command

claude mcp add --transport stdio webhiveos-webhive python webhive.py --ip 127.0.0.1 --port 7788 \
  --env RESOLUTION="1920x1080x24" \
  --env VNC_PASSWORD="optional_vnc_password" \
  --env GOOGLE_API_KEY="your_google_api_key" \
  --env OPENAI_API_KEY="your_openai_api_key" \
  --env RESOLUTION_WIDTH="1920" \
  --env ANTHROPIC_API_KEY="your_anthropic_api_key" \
  --env RESOLUTION_HEIGHT="1080" \
  --env CHROME_PERSISTENT_SESSION="true_or_false"

How to use

WebHive is a Python-based smart browser control center that lets you orchestrate AI-enabled browser tasks across multiple models and services while optionally using your own browser profiles. It supports running with a local server interface, access to an embedded UI, and integration with common AI providers like OpenAI and Anthropic through API keys. The server exposes a web-facing interface at the configured IP and port and can persist the browser session to maintain history between tasks when CHROME_PERSISTENT_SESSION is enabled. This makes it suitable for automation workflows, data collection, and browser-driven AI tasks where you want to preserve state between runs. Use the provided environment configuration to supply your API keys and browser-related settings, then start the server to begin issuing automated tasks or running AI agents inside a controlled browser context.

How to install

Prerequisites:

Python 3.11+ installed on your system
Git installed
Optionally Docker if you prefer containerized usage

Local installation steps:

Clone the repository:

git clone git@github.com:webhiveos/WebHive.git
cd WebHive

Set up Python environment (uv recommended):

uv venv --python 3.11

Activate the environment:

Windows (Command Prompt):

.venv\Scripts\activate

Windows (PowerShell):

.\ .venv\Scripts\Activate.ps1

macOS/Linux:

source .venv/bin/activate

Install Python dependencies:

uv pip install -r requirements.txt

Install Playwright (browser automation dependencies):

playwright install

Create and configure environment file:

cp .env.example .env

Edit .env to include your API keys and settings. 6. Run the WebHive server:

python webhive.py --ip 127.0.0.1 --port 7788

Docker installation (optional):

Ensure Docker and Docker Compose are installed.
Clone the repository and configure the .env file as above.
Build and start the container:

docker compose up --build

Access the interface at http://localhost:7788.

Additional notes

Tips and common considerations:

Provide API keys for the LLM providers you intend to use (OPENAI_API_KEY, ANTHROPIC_API_KEY, GOOGLE_API_KEY).
CHROME_PERSISTENT_SESSION controls whether the browser stays open between AI tasks; set to true if you want to preserve context, otherwise false for a fresh session each task.
VNC_PASSWORD can be set in the .env file to secure VNC access; the default is optional but can be overridden.
The default HTTP interface is bound to 127.0.0.1:7788; adjust --ip/--port if running behind a reverse proxy or on a different host.
ARM64 builds are supported in Docker; ensure you pull the appropriate image for your architecture.
If you encounter browser automation issues, ensure dependencies (e.g., Playwright) are installed and the environment is correctly activated.

Related MCP Servers

lobehub

72.8k

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.

serena

20.8k

A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)

Upsonic

7.8k

Agent Framework For Fintech and Banks

magic

4.5k

Super Magic. The first open-source all-in-one AI productivity platform (Generalist AI Agent + Workflow Engine + IM + Online collaborative office system)

casibase

4.5k

⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com

agents

969

Create AI Agents in a No-Code Visual Builder or TypeScript SDK with full 2-way sync. For shipping AI assistants and multi-agent AI workflows.