WebHive
Meet WebHive, the AI-powered browser that takes care of tasks for you. No more endless clicks, tell it what you need, and it gets it done.
claude mcp add --transport stdio webhiveos-webhive python webhive.py --ip 127.0.0.1 --port 7788 \ --env RESOLUTION="1920x1080x24" \ --env VNC_PASSWORD="optional_vnc_password" \ --env GOOGLE_API_KEY="your_google_api_key" \ --env OPENAI_API_KEY="your_openai_api_key" \ --env RESOLUTION_WIDTH="1920" \ --env ANTHROPIC_API_KEY="your_anthropic_api_key" \ --env RESOLUTION_HEIGHT="1080" \ --env CHROME_PERSISTENT_SESSION="true_or_false"
How to use
WebHive is a Python-based smart browser control center that lets you orchestrate AI-enabled browser tasks across multiple models and services while optionally using your own browser profiles. It supports running with a local server interface, access to an embedded UI, and integration with common AI providers like OpenAI and Anthropic through API keys. The server exposes a web-facing interface at the configured IP and port and can persist the browser session to maintain history between tasks when CHROME_PERSISTENT_SESSION is enabled. This makes it suitable for automation workflows, data collection, and browser-driven AI tasks where you want to preserve state between runs. Use the provided environment configuration to supply your API keys and browser-related settings, then start the server to begin issuing automated tasks or running AI agents inside a controlled browser context.
How to install
Prerequisites:
- Python 3.11+ installed on your system
- Git installed
- Optionally Docker if you prefer containerized usage
Local installation steps:
- Clone the repository:
git clone git@github.com:webhiveos/WebHive.git
cd WebHive
- Set up Python environment (uv recommended):
uv venv --python 3.11
Activate the environment:
- Windows (Command Prompt):
.venv\Scripts\activate
- Windows (PowerShell):
.\ .venv\Scripts\Activate.ps1
- macOS/Linux:
source .venv/bin/activate
- Install Python dependencies:
uv pip install -r requirements.txt
- Install Playwright (browser automation dependencies):
playwright install
- Create and configure environment file:
cp .env.example .env
Edit .env to include your API keys and settings. 6. Run the WebHive server:
python webhive.py --ip 127.0.0.1 --port 7788
Docker installation (optional):
- Ensure Docker and Docker Compose are installed.
- Clone the repository and configure the .env file as above.
- Build and start the container:
docker compose up --build
- Access the interface at http://localhost:7788.
Additional notes
Tips and common considerations:
- Provide API keys for the LLM providers you intend to use (OPENAI_API_KEY, ANTHROPIC_API_KEY, GOOGLE_API_KEY).
- CHROME_PERSISTENT_SESSION controls whether the browser stays open between AI tasks; set to true if you want to preserve context, otherwise false for a fresh session each task.
- VNC_PASSWORD can be set in the .env file to secure VNC access; the default is optional but can be overridden.
- The default HTTP interface is bound to 127.0.0.1:7788; adjust --ip/--port if running behind a reverse proxy or on a different host.
- ARM64 builds are supported in Docker; ensure you pull the appropriate image for your architecture.
- If you encounter browser automation issues, ensure dependencies (e.g., Playwright) are installed and the environment is correctly activated.
Related MCP Servers
lobehub
The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.
serena
A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)
Upsonic
Agent Framework For Fintech and Banks
magic
Super Magic. The first open-source all-in-one AI productivity platform (Generalist AI Agent + Workflow Engine + IM + Online collaborative office system)
casibase
⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com
agents
Create AI Agents in a No-Code Visual Builder or TypeScript SDK with full 2-way sync. For shipping AI assistants and multi-agent AI workflows.