KoboldCPP
AI to AI comms with koboldcpp from Claude/other MCP compatible apps
claude mcp add --transport stdio phialsbasement-koboldcpp-mcp-server node dist/index.js \ --env NODE_ENV="production" \ --env KOBOLD_API_KEY="your-api-key-if-needed" \ --env KOBOLD_API_URL="http://localhost:5001"
How to use
Kobold MCP Server provides an MCP-compatible interface to KoboldAI, enabling text generation, chat completion with memory, and OpenAI-compatible endpoints. Once running, clients can send requests through the MCP protocol to perform standard text generation, chat with persistent context, or leverage additional capabilities like stable diffusion, audio transcription, and TTS via the KoboldAI integration. The server is implemented in TypeScript and packages the KoboldAI interaction logic behind MCP endpoints, so applications that expect MCP can interact without direct KoboldAI-specific calls. To get started, install the npm package, configure the KoboldAI API endpoint, and launch the server. You can then connect your MCP client to the server-name and issue requests as you would with other MCP servers, including text completion, chat, and memory-enabled conversations. The server also exposes OpenAI-like text completion endpoints to ease integration with tools expecting OpenAI-compatible requests.
How to install
Prerequisites:
- Node.js v16 or higher
- npm or yarn
- Access to a KoboldAI instance (running and reachable via HTTP API)
Install the MCP server package:
npm install kobold-mcp-server
If you plan to run from source, clone and build:
git clone https://github.com/yourusername/kobold-mcp-server.git
cd kobold-mcp-server
npm install
npm run build
Configure and run:
- Create a configuration file or set environment variables to point to your KoboldAI API endpoint.
- Start the server (the package exports the Kobold MCP server class and you can run via your own Node script or use a runner that imports the package).
Example run with a simple Node script:
// run-server.js
import { KoboldMCPServer } from 'kobold-mcp-server';
const server = new KoboldMCPServer({ apiUrl: 'http://localhost:5001' });
server.start();
Then execute:
node run-server.js
Alternatively, if you expose a CLI in your environment, you can start it using that mechanism after building.
Additional notes
Environment variables and configuration options:
- KOBOLD_API_URL: URL of your KoboldAI API endpoint (required for the server to route requests)
- KOBOLD_API_KEY: Optional API key if your KoboldAI deployment requires authentication
- NODE_ENV: Set to production for optimized runtime Common issues:
- Ensure KoboldAI is running and accessible from the MCP server host
- Verify CORS/settings on the KoboldAI API if you encounter request failures
- If using Stable Diffusion or audio features, confirm the respective services are reachable and properly configured Configuration tips:
- Use a dedicated network or container for KoboldAI and the MCP server to reduce latency
- Monitor logs for request routing errors to identify misconfigured API URLs
Related MCP Servers
zen
Selfhosted notes app. Single golang binary, notes stored as markdown within SQLite, full-text search, very low resource usage
MCP -Deepseek_R1
A Model Context Protocol (MCP) server implementation connecting Claude Desktop with DeepSeek's language models (R1/V3)
mcp-fhir
A Model Context Protocol implementation for FHIR
mcp
Inkdrop Model Context Protocol Server
mcp-appium-gestures
This is a Model Context Protocol (MCP) server providing resources and tools for Appium mobile gestures using Actions API..
dubco -npm
The (Unofficial) dubco-mcp-server enables AI assistants to manage Dub.co short links via the Model Context Protocol. It provides three MCP tools: create_link for generating new short URLs, update_link for modifying existing links, and delete_link for removing short links.