Get the FREE Ultimate OpenClaw Setup Guide →

eval-ui-ux

Scanned
npx machina-cli add skill Ibrahim-3d/conductor-orchestrator-superpowers/eval-ui-ux --openclaw
Files (1)
SKILL.md
9.3 KB

UI/UX Evaluator Agent

Specialized evaluator for tracks whose deliverables are visual UI — screens, components, design systems, layouts.

When This Evaluator Is Used

Dispatched by loop-execution-evaluator when the track is one of:

  • Screen implementation
  • Design system work
  • Component library
  • UI polish or UX audit

Inputs Required

  1. Track's spec.md — what was supposed to be built
  2. Track's plan.md — tasks that should be complete
  3. Design system reference — your project's global CSS or token file (e.g., src/app/globals.css)
  4. Components to evaluate — all files in src/components/ and src/app/
  5. Data files — content JSON files used for copy (if applicable)

Evaluation Passes (8 checks)

Pass 1: Design System Adherence

Read your project's CSS/token file to extract the token system, then check components:

CheckWhat to Look For
ColorsCSS custom properties (--color-*, --brand-*) used, no raw hex/rgb in components
SpacingTailwind spacing classes follow consistent grid, no arbitrary px values
TypographyFont families from your design system fonts, sizes from scale
RadiusUses token-defined radius values, no random rounded-* overrides
ShadowsShadow classes from token system, consistent elevation levels
Glass-morphismBackdrop-blur, bg-opacity patterns on cards/modals/overlays (if applicable)
### Design System Adherence: PASS / FAIL
- Hardcoded colors found: [count] — [list files:lines]
- Hardcoded spacing found: [count] — [list files:lines]
- Typography violations: [count] — [list]
- Token coverage: [X]% of visual properties use design tokens

Pass 2: Visual Consistency

Compare styling patterns across screens:

CheckWhat to Look For
Spacing rhythmSame gap/padding patterns across sections
Color usageBrand palette applied consistently (not random grays)
Component stylingSame component (Card, Button) looks identical on all pages
Icon sizingIcons use consistent size props
Page structureSimilar content types have similar visual treatment
### Visual Consistency: PASS / FAIL
- Inconsistencies found: [count]
- Affected screens: [list]
- Specific issues: [describe each]

Pass 3: Layout & Structure

CheckWhat to Look For
Header presenceHeader component rendered on every page (or layout group)
Footer presenceFooter component rendered on every page
Container usageMax-width Container wraps content on all pages
Section usageVertical spacing via Section component
Visual hierarchyh1 → h2 → body → actions ordering clear
Content widthNo full-bleed text blocks (constrained width)
### Layout & Structure: PASS / FAIL
- Pages missing Header: [list]
- Pages missing Footer: [list]
- Pages missing Container: [list]
- Hierarchy issues: [describe]

Pass 4: Responsive Behavior

Check component classes and layout patterns:

BreakpointWhat to Check
375px (mobile)Single column, stacked layout, touch-friendly
768px (tablet)2-column grids, adjusted spacing
1024px+ (desktop)Full layout, 3-4 column grids
CheckWhat to Look For
Grid collapsegrid-cols-1 md:grid-cols-2 lg:grid-cols-3 patterns
Horizontal scrollNo overflow-x issues, no fixed-width elements
Touch targetsButtons/links >= 44px on mobile (min-h-11, p-3, etc.)
Mobile menuHeader collapses to hamburger/sheet on mobile
Text truncationLong text doesn't break layout
### Responsive: PASS / FAIL
- Breakpoints covered: [375/768/1024]
- Pages with issues: [list]
- Touch target violations: [list components]

Pass 5: Component States & Conditional Rendering

Check interactive components for complete state coverage and safe rendering logic:

Component TypeRequired States
Buttondefault, hover, active, focus, disabled, loading
Inputdefault, focus, error, disabled, placeholder
Carddefault, hover (if interactive)
Modalopen/close animation, backdrop, focus trap, escape-to-close
Toastsuccess, error, info variants
Loadingspinner or skeleton for every async operation

Conditional Rendering Patterns to Check:

Anti-PatternWhat to FlagFix
Magic string comparisonstatus === 'ready' && <Image />Use explicit arrays: ['ready', 'locked'].includes(status)
Non-exhaustive switchMissing default caseAdd TypeScript exhaustiveness check
Mixed visual/data statestatus = locked ? 'locked' : 'ready'Separate: const isLocked = item.locked; const hasContent = !!imageUrl;
Status explosion7+ status codesSplit into orthogonal states

Example Brittle Pattern:

// BAD: Image disappears when status changes
{imageUrl && status === 'ready' && <Image />}
// When item is locked, status becomes 'locked' → image hidden!

// GOOD: Explicit list of statuses that show images
const STATUSES_WITH_IMAGES = ['ready', 'locked', 'outdated'];
{imageUrl && STATUSES_WITH_IMAGES.includes(status) && <Image />}

// BETTER: Separate concerns
const shouldShowImage = imageUrl && !['generating', 'error'].includes(status);
{shouldShowImage && <Image />}
### Component States & Rendering: PASS / FAIL
- Missing states: [component: missing state]
- Components audited: [count]
- Coverage: [X]% have all required states
- **Brittle conditionals found: [count] — [list files:lines]**
- **Non-exhaustive switches: [count] — [list]**
- **Status explosion (>7 codes): [list components]**

Pass 6: Animation & Transitions

CheckWhat to Look For
Page transitionsPage transition wrapper or framer-motion layout
Hover effectsSubtle scale/shadow/opacity changes on interactive elements
Loading animationsSpinner/skeleton with smooth animation
Modal transitionsFade/scale on open/close
State transitionsNo sudden jumps between states
### Animations: PASS / FAIL
- Pages missing transitions: [list]
- Components missing hover effects: [list]
- Jarring state changes: [describe]

Pass 7: Accessibility Baseline

CheckWCAG LevelWhat to Look For
LabelsAAll <input> elements have associated <label>
Button textAAll buttons have visible text or aria-label
Alt textAAll <img> elements have meaningful alt
Color contrastAAText/background contrast >= 4.5:1 (body), >= 3:1 (large)
Focus visibleAAFocus ring visible on all interactive elements
Focus orderATab order follows visual reading order
Modal focusAModal traps focus when open
### Accessibility: PASS / FAIL
- Missing labels: [list inputs]
- Missing alt text: [list images]
- Contrast issues: [list]
- Focus order issues: [describe]

Pass 8: Usability Check (Copy Quality)

Read all user-facing text in components and data files:

CheckWhat to Look For
HeadingsSimple, friendly, no jargon
CTAsAction verbs that clearly describe the outcome
ErrorsTell user what to do, not technical details
LabelsEveryday language (e.g., "Your name", not "Name identifier")
JargonNo: "auth", "render", "deps", "schema", "API", "submit"
ToneReassuring, not pushy (especially paywall, error states)

Verify against your target personas — would they understand every piece of text without explanation?

### Usability Check: PASS / FAIL
- Jargon found: [word: file:line]
- Unfriendly copy: [text: file:line]
- Suggested rewrites: [original → suggested]

Verdict Template

## UI/UX Evaluation Report

**Track**: [track-id]
**Evaluator**: eval-ui-ux
**Date**: [YYYY-MM-DD]
**Screens Evaluated**: [count]

### Results
| Pass | Status | Issues |
|------|--------|--------|
| 1. Design System | PASS/FAIL | [count] issues |
| 2. Visual Consistency | PASS/FAIL | [count] issues |
| 3. Layout & Structure | PASS/FAIL | [count] issues |
| 4. Responsive | PASS/FAIL | [count] issues |
| 5. Component States | PASS/FAIL | [count] issues |
| 6. Animations | PASS/FAIL | [count] issues |
| 7. Accessibility | PASS/FAIL | [count] issues |
| 8. Usability Check | PASS/FAIL | [count] issues |

### Verdict: PASS / FAIL
[If FAIL, list specific fix actions for loop-fixer]

Handoff

  • PASS → Return to loop-execution-evaluator → Conductor marks complete
  • FAIL → Return to loop-execution-evaluator → Conductor dispatches loop-fixer

Source

git clone https://github.com/Ibrahim-3d/conductor-orchestrator-superpowers/blob/master/skills/eval-ui-ux/SKILL.mdView on GitHub

Overview

It is a specialized evaluator for tracks whose deliverables are visual UI screens components or design systems. It validates design system adherence visual consistency layout structure responsive behavior component states animations accessibility baseline and copy usability. It is dispatched by loop-execution-evaluator when a track type is ui design-system or screens and triggered by evaluate UI UI review design review or visual audit.

How This Skill Works

The evaluator consumes inputs such as spec.md plan.md a design system token file and all components and data used for copy. It then runs eight evaluation passes covering design system adherence visual consistency layout and structure responsive behavior component states animations accessibility baseline and copy usability. It checks design token usage by reading the token file for colors spacing typography radius and shadows and flags hardcoded values or gaps. The results are produced as a structured report with clear PASS or FAIL outcomes and file references.

When to Use It

  • Screen implementation
  • Design system work
  • Component library
  • UI polish
  • UX audit

Quick Start

  1. Step 1: Gather spec.md plan.md design system token file components and data files
  2. Step 2: Run the eval-ui-ux evaluation across the eight passes and collect findings
  3. Step 3: Produce a report with PASS/FAIL results and annotated fixes

Best Practices

  • Start from the design system token file and rely on tokens rather than hardcoded values
  • Document token coverage and surface any violations with precise file and line references
  • Compare visuals across screens to ensure consistent spacing color usage and typography
  • Validate responsive behavior at mobile tablet and desktop breakpoints
  • Record accessibility baseline and copy quality checks and note any aria or semantic issues

Example Use Cases

  • Audit a new dashboard screen against the design system
  • Review a UI component library for parity across pages
  • Verify responsive reflow from mobile to desktop
  • Check animations and transitions for token aligned motion
  • Assess copy clarity and accessibility baseline across screens

Frequently Asked Questions

Add this skill to your agents
Sponsor this space

Reach thousands of developers