MAI-UI

MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B

Installation

Run this command in your terminal to add the MCP server to Claude Code.

Run in terminal:

Command

claude mcp add --transport stdio tongyi-mai-mai-ui python -m mai_ui.server \
  --env MAI_UI_CONFIG="Path to MAI-UI server config (e.g., config.yaml or config.json if required)" \
  --env MAI_UI_MODEL_PATH="Path to MAI-UI model weights (if applicable)"

How to use

MAI-UI is a GUI agent foundation model family designed to operate across devices and the cloud, enabling an agent to interact with a user and leverage MCP tools to accomplish tasks. The server exposes a set of capabilities that let you issue natural language instructions, which MAI-UI interprets to perform GUI-grounded reasoning, navigation, and cross-application actions via MCP tooling. Typical usage involves invoking the MCP call surface (for example mcp_call) to access tools or plugins (such as map navigation, shopping, calendar, or other app integrations) and then guiding the agent through a task sequence with human-in-the-loop prompts when needed. You can observe demos where the agent asks clarifying questions, plans multi-step workflows, and delegates subtasks to appropriate tools, coordinating device-local computation with cloud-backed models when data or capabilities exceed device limits.

How to install

Prerequisites:

Python 3.8+ installed on your system
Access to the internet to fetch dependencies
Optional: a compatible GPU for accelerated inference

Create and activate a virtual environment (recommended): python -m venv venv source venv/bin/activate # macOS/Linux venv\Scripts\activate # Windows
Install dependencies (adjust if a requirements file exists in the repo): pip install -r requirements.txt

or install directly if no requirements file:

pip install numpy torch transformers pillow
Install MAI-UI package (if published to PyPI or from source):

From PyPI

pip install mai-ui

From source (if applicable)

git clone https://github.com/tongyi-mai/MAI-UI.git cd MAI-UI pip install -e .
Configure the server:
- Create a config file (config.yaml or config.json) if required by the MAI-UI server.
- Set environment variables as needed (see mcp_config env section).
Run the MCP server (example): python -m mai_ui.server
Verify the server is running by sending a test MCP request to the endpoint configured by the server (often http://localhost:8000 or a defined port).

Additional notes

Tips and common issues:

Ensure Python dependencies compatible with your CUDA/cuDNN setup if you use GPU acceleration.
If the server fails to start due to missing models, verify the paths in MAI_UI_CONFIG and MAI_UI_MODEL_PATH environment variables.
For MCP tooling, you may need to expose network access to MCP tool endpoints (e.g., map services, calendar, or shopping APIs) and provide necessary API keys or tokens.
Monitor logs for prompts where the agent requests clarification; providing precise task instructions improves task success rates.
If you encounter permission errors, run the server with appropriate user permissions or adjust file permissions on model weights and config files.