mcp -evaluations-skills

Evaluate MCP servers for quality and reliability. Verify tool functionality, test error handling, generate tests, and assess response quality with no dependencies other than curl. Use this when validating MCP server implementations, testing OpenAPI-to-MCP conversions, or assessing API tool quality.

Installation

Run this command in your terminal to add the MCP server to Claude Code.

Run in terminal:

View docs

Command

claude mcp add --transport stdio mcp-com-ai-mcp-server-evaluations-skills node server.js \
  --env PORT="3030 (default) or set to preferred port" \
  --env LOG_LEVEL="info|debug (optional)"

How to use

This MCP server provides an automated evaluation workflow for MCP servers, focusing on tool discovery, health checks, functional testing, and error handling. It exposes an API-based interface that you can exercise with curl (and jq for JSON parsing) to verify health, list available tools, call tools with valid arguments, and deliberately trigger errors to assess resilience. To use it, first ensure the evaluator server is running and reachable at its configured host and port (default http://localhost:3030). The quickstart workflow demonstrates how to check health, list tools, invoke a tool, and generate an error to test the quality checks. You can script these calls to automate repeated evaluations as part of a CI workflow and use the evaluation rubric to gauge performance over time.

How to install

Prerequisites:

curl
jq
Optional: bun or node for local automation

Install steps:

Clone the MCP Evaluations Skill repository git clone https://github.com/your-org/mcp-evaluations-skills.git cd mcp-evaluations-skills
Install dependencies
- If a package.json is present (Node.js): npm install
- If using Bun (optional): bun install
Run the MCP Evaluations Skill server
- If a start script is defined: npm run start
- Or run directly with Node: node server.js
Verify the server is running (default port 3030) curl -s http://localhost:3030/health

Note: If you configure a different port, ensure you update your requests accordingly.

Additional notes

Tips and considerations:

The evaluator expects health at /health and tools discovery/calling via /mcp endpoint. If you customize ports or paths, keep consistency across tests.
Use curl + jq for quick validation of responses; the README examples demonstrate typical payloads for tools/list and tools/call.
Environment variables such as PORT and LOG_LEVEL can help with logging and running in containers or CI environments.
If your server uses a different start command or a Docker image, adjust the mcp_config accordingly and ensure the port is exposed.
Common issues: server not reachable (check network/port), invalid tool name or arguments (ensure correct input shapes), and JSON-RPC formatting errors.
For CI automation, you can script the health check, tools/list, and a sample tools/call sequence to automatically verify tool availability and basic correctness.

Related MCP Servers

any-chat-completions

148

MCP Server for using any LLM as a Tool

time

⏰ Time MCP Server: Giving LLMs Time Awareness Capabilities

skrills

Coordinate skills between Codex, Copilot, and Claude Code. Validates, analyzes, and syncs skills, subagents, commands, and configuration between multiple CLIs.

unity-editor

An MCP server and client for LLMs to interact with Unity Projects

website-publisher

AI Website builder and publisher MCP. Quickly publish and deploy your AI generated code as real website URL. Support html, css, js, python etc.

xgmem

Global Memory MCP server, that manage all projects data.