llama _mcp_proxy

Simple node proxy for llama-server that enables MCP use

Installation

Run this command in your terminal to add the MCP server to Claude Code.

Run in terminal:

Command

claude mcp add --transport stdio extopico-llama-server_mcp_proxy npx search1api-mcp --port 0 --api-key YOUR_SEARCH1API_KEY_PLACEHOLDER \
  --env DEBUG="true"

How to use

This MCP Proxy acts as a bridge between your llama-server instance and MCP-compatible tool servers. It intercepts chat completion requests, injects a dynamically generated system prompt that lists available tools and their descriptions, and enables the language model to call tools iteratively within a single user turn. When the LLM indicates a tool call, the proxy parses the call, executes the corresponding MCP tool via its configured client, streams progress and results back to the client, and feeds the tool results back to the LLM to continue the conversation. It also preserves streaming of the LLM’s generated messages (Server-Sent Events), including intermediate statuses like [Executing tool...] and [Tool Result: ...]. To use it, point your LLM UI to the proxy URL (for example http://localhost:9090) and ensure your llama-server is running (default http://localhost:8080). The proxy will handle tool usage automatically once you’ve configured one or more MCP tool servers via mcp-config.json (e.g., search1api).

How to install

Prerequisites:

Node.js version 18 or higher
npm (comes with Node.js)
A running llama-server instance
One or more MCP tool servers you want the LLM to access

Step-by-step:

Prepare your project directory and install dependencies

mkdir llama-server-mcp-proxy
cd llama-server-mcp-proxy
npm init -y
npm install @modelcontextprotocol/sdk

Create the proxy script (e.g., llama-server-mcp-proxy.js) and implement the proxy logic as described in the README. If you already have the script, place it here.
Create mcp-config.json describing your MCP tool servers (see the example in the README).

{
  "mcpServers": {
    "search1api": {
      "command": "npx",
      "args": [
        "search1api-mcp",
        "--port",
        "0",
        "--api-key",
        "YOUR_SEARCH1API_KEY_PLACEHOLDER"
      ],
      "env": {
        "DEBUG": "true"
      }
    }
  }
}

Start the proxy

npm start

Or run directly with Node if you have the script named accordingly:

node llama-server-mcp-proxy.js

(Optional) set environment variables to customize behavior (PORT, LLAMA_SERVER_URL, MCP_CONFIG_PATH, MCP_PROXY_DEBUG, LLAMA_SERVER_TIMEOUT).

Additional notes

Tips and notes:

Keep your mcp-config.json secure because it may contain API keys. Add it to .gitignore.
The proxy defaults to http://localhost:8080 for llama-server and port 9090 for the proxy; adjust with PORT and LLAMA_SERVER_URL.
If a tool server requires authentication, supply appropriate API keys in mcp-config.json (as shown in the example).
For debugging, enable MCP_PROXY_DEBUG to get verbose logs; watch for [Executing tool...] and [Tool Result: ...] messages to understand the flow.
If you encounter connection errors to a tool server, ensure the tooling server is listening on the port selected by the proxy (port 0 lets the server pick a free port).
You can add additional MCP tool servers in mcp-config.json as needed, each with its own command, arguments, and environment variables.

Related MCP Servers

zen

1.1k

Selfhosted notes app. Single golang binary, notes stored as markdown within SQLite, full-text search, very low resource usage

MCP -Deepseek_R1

A Model Context Protocol (MCP) server implementation connecting Claude Desktop with DeepSeek's language models (R1/V3)

mcp-fhir

A Model Context Protocol implementation for FHIR

mcp

Inkdrop Model Context Protocol Server

mcp-appium-gestures

This is a Model Context Protocol (MCP) server providing resources and tools for Appium mobile gestures using Actions API..

dubco -npm

The (Unofficial) dubco-mcp-server enables AI assistants to manage Dub.co short links via the Model Context Protocol. It provides three MCP tools: create_link for generating new short URLs, update_link for modifying existing links, and delete_link for removing short links.