Get the FREE Ultimate OpenClaw Setup Guide →

eval-business-logic

Scanned
npx machina-cli add skill Ibrahim-3d/conductor-orchestrator-superpowers/eval-business-logic --openclaw
Files (1)
SKILL.md
6.1 KB

Business Logic Evaluator Agent

Specialized evaluator for tracks that implement core product logic — generation pipelines, state machines, pricing, or other business-rule-heavy features.

When This Evaluator Is Used

Dispatched by loop-execution-evaluator when the track involves:

  • Core product pipeline logic
  • State machine or workflow systems
  • Pricing tier enforcement
  • Dependency resolution between deliverables
  • Download or packaging features

Inputs Required

  1. Track's spec.md and plan.md
  2. conductor/product.md — product rules (deliverables, tiers, dependencies)
  3. Project-specific pipeline/prompt configurations (if applicable)
  4. Data definition files (e.g., asset definitions, feature configs)
  5. Implementation code being evaluated

Evaluation Passes (6 checks)

Pass 1: Product Rules Compliance

Check against rules defined in conductor/product.md:

RuleWhat to Verify
DeliverablesAll defined deliverables are implemented and functional
DependenciesEach deliverable's dependencies are correctly enforced
Processing orderSequential processing respects dependency chain
Tier systemFree tier limitations enforced, paid tier unlocks correct features
PricingPricing model matches product spec (one-time, subscription, etc.)
State rulesState transitions (e.g., lock/unlock, draft/publish) propagate correctly
### Product Rules: PASS / FAIL
- Rules checked: [count]
- Violations: [list rule: actual behavior]
- Deliverables functional: [X]/[total]

Pass 2: Feature Correctness

For each feature in the spec, verify it works correctly:

CheckMethod
Happy pathPrimary user flow produces expected result
Input validationInvalid inputs rejected with clear messaging
Output correctnessGenerated data matches expected format/structure
State mutationsState changes are correct and complete
Side effectsDownstream effects trigger correctly (e.g., dependency propagation)
### Feature Correctness: PASS / FAIL
- Features tested: [count]
- Correct: [count]
- Failures: [describe each]

Pass 3: Edge Cases

ScenarioWhat to Verify
Empty stateFirst-time user with no data
Boundary valuesMax input length, empty inputs, special characters
Concurrent operationsWhat happens if user triggers 2 operations at once
Network failure mid-operationPartial state handled correctly
Re-processingRe-running an operation on existing data prompts confirmation if needed
All items locked/finalizedUI reflects that no further changes are possible
Tier limitsExceeding free tier limit shows upgrade prompt
### Edge Cases: PASS / FAIL
- Scenarios checked: [count]
- Unhandled: [list]
- User impact: [describe]

Pass 4: State Transitions

Verify state machine correctness for your project's state model. Example pattern:

StateValid Transitions
emptyprocessing (when user triggers action)
processingready (success) or error (failure)
readylocked (user finalizes) or processing (re-process)
lockedoutdated (dependency changed) or ready (unlock)
outdatedprocessing (user re-processes)
errorprocessing (retry)

Adapt the state table above to match your project's actual states.

### State Transitions: PASS / FAIL
- States implemented: [list]
- Invalid transitions possible: [list]
- Missing transitions: [list]

Pass 5: Data Flow

CheckWhat to Verify
Input → ProcessingUser form data correctly feeds into processing pipeline
Processing → OutputResults stored/displayed correctly
Output → PersistenceResults saved to store/database
Cross-componentData shared correctly between components
Stale dataNo stale renders after state changes
### Data Flow: PASS / FAIL
- Flow verified: [input → output]
- Stale data issues: [describe]
- Data loss points: [list]

Pass 6: User Journey Completeness

Walk through the complete user journey for the feature under evaluation. Example structure:

1. User provides input (form, selection, etc.)
2. System processes input
3. User reviews output
4. User can lock/finalize results
5. System handles dependencies between outputs
6. User views all deliverables
7. User can export/download results
8. User can re-process any unlocked item
9. Locked items show "outdated" if dependencies change

Adapt the journey steps above to match your project's actual user flow.

### User Journey: PASS / FAIL
- Steps completed: [X]/[total]
- Broken at step: [which]
- User experience: [smooth / friction at: describe]

Verdict Template

## Business Logic Evaluation Report

**Track**: [track-id]
**Evaluator**: eval-business-logic
**Date**: [YYYY-MM-DD]

### Results
| Pass | Status | Issues |
|------|--------|--------|
| 1. Product Rules | PASS/FAIL | [details] |
| 2. Feature Correctness | PASS/FAIL | [details] |
| 3. Edge Cases | PASS/FAIL | [details] |
| 4. State Transitions | PASS/FAIL | [details] |
| 5. Data Flow | PASS/FAIL | [details] |
| 6. User Journey | PASS/FAIL | [details] |

### Verdict: PASS / FAIL
[If FAIL, list specific fix actions for loop-fixer]

Handoff

  • PASS → Return to loop-execution-evaluator → Conductor marks complete
  • FAIL → Return to loop-execution-evaluator → Conductor dispatches loop-fixer

Source

git clone https://github.com/Ibrahim-3d/conductor-orchestrator-superpowers/blob/master/skills/eval-business-logic/SKILL.mdView on GitHub

Overview

The eval-business-logic skill acts as a specialized evaluator for tracks that implement core product logic—pipelines, state machines, pricing, and packaging. It checks feature correctness against product rules, tests edge cases and data flow, and validates user-journey completeness. It is dispatched by loop-execution-evaluator for track types 'business-logic', 'generator', or 'core-feature' and is triggered by evaluate logic, test business rules, verify business rules, or check feature.

How This Skill Works

It ingests spec.md, plan.md, conductor/product.md (product rules), plus data definitions and the implementation code being evaluated. It then performs multiple evaluation passes focused on product rules compliance, feature correctness, edge cases, and state transitions, and returns a structured report with detailed pass/fail statuses and actionable findings.

When to Use It

  • Evaluate core product pipeline logic and data flow
  • Validate state machine/workflow behavior and transitions
  • Enforce pricing tiers and feature gating
  • Verify dependencies between deliverables
  • Test packaging or download features for correct artifacts

Quick Start

  1. Step 1: Gather track inputs (spec.md, plan.md, product rules, data definitions, and code)
  2. Step 2: Run the eval-business-logic agent via the Evaluate-Loop
  3. Step 3: Review the PASS/FAIL report, fix violations, and re-run

Best Practices

  • Align checks to rules in conductor/product.md and project specs
  • Explicitly verify dependencies for each deliverable
  • Test happy path and invalid inputs with clear messaging
  • Validate state transitions propagate across components
  • Review downstream effects and data flow (side effects) for cascade correctness

Example Use Cases

  • Confirm a deployment pipeline enforces dependencies and ordering
  • Verify state transitions from draft to publish and lock when finalized
  • Ensure pricing tier unlocks match product spec
  • Validate packaging feature creates correct artifact names and metadata
  • Test an operation under network interruption to ensure partial state handling

Frequently Asked Questions

Add this skill to your agents
Sponsor this space

Reach thousands of developers