What triggers eval-code-quality?

It is dispatched by the loop-execution-evaluator when the track type is feature, refactor, or infrastructure and is triggered by evaluate code, code review, quality check, or build check.

What inputs are required to run it?

Track spec.md, plan.md, changed files from plan tasks or git diff, tsconfig.json, package.json, and any existing tests.

eval-code-quality

Scanned

npx machina-cli add skill Ibrahim-3d/conductor-orchestrator-superpowers/eval-code-quality --openclaw

Files (1)

SKILL.md

5.9 KB

Code Quality Evaluator Agent

Specialized evaluator for tracks whose deliverables are functional code — features, state management, utilities, API routes.

When This Evaluator Is Used

Dispatched by loop-execution-evaluator when the track is one of:

Feature implementation (e.g., user authentication, data processing)
Infrastructure/utility work
Refactoring tracks
State management (Zustand, hooks)

Inputs Required

Track's spec.md and plan.md
Changed files (from plan.md task summaries or git diff)
tsconfig.json — TypeScript config
package.json — dependencies and scripts
Existing test files (if any)

Evaluation Passes (6 checks)

Pass 1: Build Integrity

npm run build    # Must exit 0
npx tsc --noEmit # Must exit 0 (no type errors)

### Build: PASS ✅ / FAIL ❌
- Build status: [success / X errors]
- Type check: [clean / X type errors]
- Errors: [list if any]

Pass 2: Type Safety

Check	What to Look For
No `any` types	Explicit typing on all exports, function params, return types
Generic usage	API responses typed with `ApiResponse<T>`
Null safety	Optional chaining (`?.`) or null checks where data may be absent
Type exports	Shared types in `src/types/`, not inline
Interface consistency	Types match spec/product.md schema

### Type Safety: PASS ✅ / FAIL ❌
- `any` usage: [count] — [list files:lines]
- Missing types: [list untyped exports]
- Null safety issues: [list]

Pass 3: Code Patterns & State Management

Check	What to Look For
File structure	Files in correct directories per component architecture
Naming	kebab-case files, PascalCase components, `{Component}Props`
Imports	No circular imports, no unused imports
DRY	No significant code duplication (>10 lines repeated)
Single responsibility	Functions/components do one thing
Module boundaries	Feature code in feature dirs, shared code in `ui/` or `lib/`
State sync	Every client state mutation has corresponding API endpoint
Optimistic updates	Rollback logic present on API failure
Source of truth	Server (DB) is source of truth, client is cache

State Sync Anti-Patterns to Flag:

// ❌ BAD: State updated without API persistence
const toggleLock = (id) => {
  set({ assets: { ...assets, [id]: { locked: true } } });
  // No API call!
}

// ✅ GOOD: Optimistic update with API sync
const toggleLock = async (id) => {
  const prev = assets;
  set({ assets: { ...assets, [id]: { locked: true } } }); // Optimistic
  try {
    await fetch(`/api/assets/${id}`, {
      method: 'PATCH',
      body: JSON.stringify({ locked: true })
    });
  } catch (err) {
    set({ assets: prev }); // Rollback
    throw err;
  }
}

### Code Patterns & State Sync: PASS ✅ / FAIL ❌
- Naming violations: [list]
- Unused imports: [list files]
- Duplication found: [describe]
- **State mutations without API: [count] — [list]**
- **Missing rollback logic: [count] — [list]**
- **API endpoints without client updates: [count] — [list]**

Pass 4: Error Handling

Check	What to Look For
API calls	try/catch or error handling on all async operations
User feedback	Toast/inline error shown to user on failure
Null data	Empty states handled (no data, loading, error)
Edge cases	Invalid input, network failure, timeout
No silent failures	Errors not swallowed without user notification

### Error Handling: PASS ✅ / FAIL ❌
- Unhandled async: [list functions]
- Missing user feedback: [list scenarios]
- Silent failures: [list]

Pass 5: Dead Code & Cleanup

Check	What to Look For
Unused exports	Functions/components exported but never imported
Commented code	Large blocks of commented-out code (should be deleted)
Unused files	Files that exist but aren't imported anywhere
TODO/FIXME	Unresolved TODO comments
Console logs	`console.log` left in production code

### Dead Code: PASS ✅ / FAIL ❌
- Unused exports: [list]
- Console logs: [list files:lines]
- TODOs: [list]

Pass 6: Test Coverage (when applicable)

Check	Target
Overall coverage	70%
Business logic	90%
API routes	80%
Utility functions	80%

### Tests: PASS ✅ / FAIL ❌ / ⚠️ NOT CONFIGURED
- Coverage: [X]% overall
- Business logic: [X]%
- Untested critical paths: [list]

Verdict Template

## Code Quality Evaluation Report

**Track**: [track-id]
**Evaluator**: eval-code-quality
**Date**: [YYYY-MM-DD]
**Files Evaluated**: [count]

### Results
| Pass | Status | Issues |
|------|--------|--------|
| 1. Build | PASS/FAIL | [details] |
| 2. Type Safety | PASS/FAIL | [count] issues |
| 3. Code Patterns | PASS/FAIL | [count] issues |
| 4. Error Handling | PASS/FAIL | [count] issues |
| 5. Dead Code | PASS/FAIL | [count] issues |
| 6. Tests | PASS/FAIL/N/A | [coverage] |

### Verdict: PASS ✅ / FAIL ❌
[If FAIL, list specific fix actions for loop-fixer]

Handoff

PASS → Return to loop-execution-evaluator → Conductor marks complete
FAIL → Return to loop-execution-evaluator → Conductor dispatches loop-fixer

Source

git clone https://github.com/Ibrahim-3d/conductor-orchestrator-superpowers/blob/master/skills/eval-code-quality/SKILL.mdView on GitHub

Overview

Specialized evaluator for functional-code tracks where the deliverable is code—features, API routes, state management, and utilities. It checks build integrity, type safety, patterns, error handling, dead code, imports, test coverage, and naming conventions. Dispatched by loop-execution-evaluator for track types: feature, refactor, or infrastructure, and triggered by evaluate code, code review, quality check, or build check.

How This Skill Works

Inputs required include track spec.md and plan.md, changed files, tsconfig.json, package.json, and any existing tests. It runs through six evaluation passes: Build Integrity, Type Safety, Code Patterns & State Management, Error Handling, Import hygiene and dead code checks, and Test Coverage. The evaluator reports pass/fail results with concrete findings to guide improvements.

When to Use It

When implementing a feature delivery such as a new API route or UI state logic
During code reviews to verify quality gates and adherence to patterns
Before a build to catch issues early and prevent regressions
For infrastructure or utility work that delivers code
During refactors or state-management updates to ensure safety and consistency

Quick Start

Step 1: Gather inputs — collect track spec.md, plan.md, changed files, tsconfig.json, package.json, and existing tests
Step 2: Run build and type checks — npm run build and npx tsc --noEmit
Step 3: Review the evaluation report, fix issues, and re-run as needed

Best Practices

Run the build and type-check steps as a gate; both must pass with zero errors
Enforce explicit typing and avoid any; use ApiResponse<T> for API data when applicable
Maintain a clean file structure and follow naming conventions; avoid circular or unused imports
Ensure state mutations have corresponding API persistence or rollback logic
Verify test coverage and meaningful tests exist for new changes

Example Use Cases

Evaluate a new user authentication API route with proper types and tests
Review Zustand store changes ensuring state sync with API and rollback on failure
Audit a data-processing utility for imports, naming, and dead code
Audit an infrastructure utility integrating a logging service with tests
Validate a feature refactor that reorganizes modules into shared ui and lib boundaries

Frequently Asked Questions

Add this skill to your agents