Get the FREE Ultimate OpenClaw Setup Guide →

multimodal -client

A Multi-modal MCP client for voice powered agentic workflows

Installation
Run this command in your terminal to add the MCP server to Claude Code.
Run in terminal:
Command
claude mcp add --transport stdio ejb503-multimodal-mcp-client npx -y multimodal-mcp-client \
  --env VITE_GEMINI_API_KEY="Gemini API key placeholder" \
  --env VITE_SYSTEMPROMPT_API_KEY="Systemprompt API key placeholder"

How to use

The multimodal MCP client acts as the frontend interface that connects to MCP servers to orchestrate voice-driven, multimodal AI workflows. It enables you to interact with AI systems using natural speech, while supporting text and visual inputs through a unified MCP-based tooling system. The client works with both Systemprompt MCP servers (pre-configured and installable via the UX) and custom MCP servers you provide in a local mcp.config.custom.json. After configuring authentication keys and the MCP server you want to use, you launch the development server to start speaking to your AI workflows and invoke tools defined by the MCP server.

Within this client, you can configure and run custom tools and workflows exposed by MCP servers, manage state across multi-step interactions, and leverage the built-in multimodal capabilities to process voice, text, and visuals in real time. The tooling system lets you extend AI capabilities by adding domain-specific operations, while the voice-first design makes it practical to control complex AI pipelines using natural language commands and prompts. The client is designed to be developer-friendly, with TypeScript support, hot module replacement, and a modern tech stack to streamline integration with your own MCP servers.

How to install

Prerequisites:

  • Node.js 16.x or higher
  • npm 7.x or higher
  • A modern browser with Web Speech API support

Install and run locally:

  1. Clone the repository
git clone https://github.com/Ejb503/multimodal-mcp-client.git
cd multimodal-mcp-client
  1. Install dependencies
npm install
  1. Install proxy dependencies (if applicable)
cd proxy
npm install
  1. Create and configure local MCP server config
# Navigate to config directory
cd config

# Create or copy local configuration
cp mcp.config.example.json mcp.config.custom.json
  1. Add API keys to environment (example keys)
# In .env or your environment, with VITE_ prefix to expose to MCP
VITE_GEMINI_API_KEY=your-gemini-api-key
VITE_SYSTEMPROMPT_API_KEY=your-systemprompt-api-key
  1. Start the development server
npm run dev
  1. Open the app in your browser at http://localhost:5173

Additional notes

Tips and common issues:

  • Ensure your API keys are properly set with the VITE_ prefix so they are exposed to the MCP client and server.
  • If you’re using a custom MCP server, place its config in config/mcp.config.custom.json and ensure the path to any local server entry (e.g., index.js) is correct.
  • The client relies on Web Speech API; some browsers or environments may require permissions or additional setup to enable voice input.
  • When using the npx-based config, you can replace the package-name with your actual MCP client package name if you publish it under a different registry name.
  • If you encounter port conflicts, verify that the development server port (default 5173) is not used by another process.
  • For production deployments, consider bundling with the appropriate build command and hosting the static assets with a suitable web server.

Related MCP Servers

Sponsor this space

Reach thousands of developers