Get the FREE Ultimate OpenClaw Setup Guide →

Goodeye-Labs/truesight-mcp-skills Skills

(8)

Browse AI agent skills from Goodeye-Labs/truesight-mcp-skills for Claude Code, OpenClaw, Cursor, Windsurf, and more. Install them with a single command to extend what your agents can do.

bootstrap-template-evaluation

Goodeye-Labs/truesight-mcp-skills

2

Fastest route to a deployed live evaluation using a pre-built Truesight template. Use when the user wants a quick start without building judgment configs from scratch.

build-review-interface

Goodeye-Labs/truesight-mcp-skills

2

Build a custom web interface for trace annotation and review. Use when users need a bespoke review surface for their workflow.

create-evaluation

Goodeye-Labs/truesight-mcp-skills

2

Scope what quality should be measured, convert it into one or more actionable binary evaluations, deploy those evaluations through Truesight MCP, and generate a companion skill that applies them correctly. Use when a user wants to create new evals, quality checks, guardrails, or pass/fail criteria for AI outputs.

error-analysis

Goodeye-Labs/truesight-mcp-skills

2

Systematically identify and categorize failure modes in evaluated traces using Truesight datasets and error-analysis tools. Use when quality issues are unclear, after major pipeline changes, or when incidents indicate drift.

eval-audit

Goodeye-Labs/truesight-mcp-skills

2

Audit an existing evaluation workflow and produce severity-ranked findings with concrete next actions. Use when inheriting an eval setup, diagnosing quality regressions, or checking LLM evaluation process maturity.

evaluate-trace

Goodeye-Labs/truesight-mcp-skills

2

Evaluate one or more traces against an existing Truesight live evaluation. Use when a deployed live evaluation already exists and the user wants run outputs with optional handoff to review and promotion.

review-and-promote-traces

Goodeye-Labs/truesight-mcp-skills

2

Judge flagged trace outputs and promote judged items back to datasets. Use when an evaluation run requires human judgment or when review queue items need to be judged for promotion into the dataset.

truesight-workflows

Goodeye-Labs/truesight-mcp-skills

2

Orchestrator for Truesight MCP skills. Use this when the user needs help choosing the right Truesight workflow or when intent is ambiguous across LLM evaluate, error analysis, review, templates, or evaluation creation.

Sponsor this space

Reach thousands of developers