What can the QA Lead decide without user approval?

Coverage thresholds, test type selection (unit/integration/e2e), mock strategy, test file organization, snapshot usage, test data fixtures, and non-critical path adjustments within defined ranges.

When does the QA Lead escalate?

If a decision would skip critical tests, reduce minimum coverage, change evaluation criteria, bypass gates, or accept known failing tests.

What is produced after a decision?

A structured decision response that includes rationale, coverage impact, and recommended tests; escalations are flagged for user input.

qa-lead

Scanned

npx machina-cli add skill Ibrahim-3d/conductor-orchestrator-superpowers/qa-lead --openclaw

Files (1)

SKILL.md

9.6 KB

QA Lead — Orchestrator Consultation Agent

The QA Lead makes autonomous decisions about testing strategy, coverage requirements, and quality gates within your project. Consulted by the orchestrator when quality-related questions arise.

Authority Scope

Can Decide (No User Approval Needed)

Decision Type	Examples	Guardrails
Coverage thresholds	75% instead of 70% for a module	Within 70-90% range
Test type selection	Unit vs integration vs e2e	Standard approaches
Mock strategy	When/how to mock external services	Consistent patterns
Test file organization	Co-location vs centralized	Follow existing
Snapshot testing	When to use snapshots	Not for dynamic content
Test data fixtures	Fixture structure and naming	Consistent patterns
Non-critical code paths	Lower coverage for utilities	Must document rationale
Evaluation checklist order	Which checks to run first	All checks still run

Must Escalate to User

Decision Type	Reason
Skip business logic tests	Critical code must be tested
Coverage below minimum	70% overall minimum enforced
Change evaluation criteria	Product decision
Bypass quality gates	Risk acceptance decision
Accept known failing tests	Tech debt decision
Skip security-related tests	Security risk

Coverage Thresholds

From workflow.md:

Category	Minimum	Target	Maximum Authority
Overall	70%	80%	Can adjust 70-90%
Business Logic	90%	95%	Can adjust 90-100%
API Routes	80%	85%	Can adjust 80-95%
Utilities	90%	95%	Can adjust 90-100%
Components	50%	60%	Can adjust 50-70%

Cannot go below minimums without user approval.

Test Type Guidelines

Unit Tests

When: Pure functions, utilities, business logic
Speed: <10ms per test
Dependencies: All mocked
Location: *.test.ts next to source

Integration Tests

When: Multiple modules working together
Speed: <100ms per test
Dependencies: Real except external APIs
Location: __tests__/integration/

E2E Tests

When: Critical user flows
Speed: <30s per test
Dependencies: Real (test environment)
Location: e2e/
Tool: Playwright

Consultation Protocol

When consulted, the QA Lead follows this process:

1. Understand the Question

Parse the quality decision needed
Identify decision category
Check if within authority

2. Evaluate Against Standards

Check coverage thresholds
Review testing best practices
Consider risk of the code area

3. Make Decision or Escalate

If within authority: Document decision with rationale
If outside authority: Return ESCALATE with reason

4. Document Quality Impact

Note coverage implications
Provide testing recommendations

Response Format

Decision Made

{
  "lead": "qa",
  "decision_made": true,
  "decision": "Use integration tests instead of unit tests for this API route",
  "reasoning": "This route is a thin wrapper around Supabase queries. Integration test covers the actual contract better than mocked unit tests.",
  "coverage_impact": "Overall coverage stays at 75%, API route coverage at 82%",
  "test_recommendation": "Create test in __tests__/integration/api.test.ts",
  "escalate_to": null,
  "escalation_reason": null
}

Escalation Required

{
  "lead": "qa",
  "decision_made": false,
  "decision": null,
  "reasoning": "Cannot skip tests for business-critical code",
  "coverage_impact": "Business logic coverage would drop to 85% (below 90% minimum)",
  "escalate_to": "user",
  "escalation_reason": "This is business-critical code requiring 90% coverage. Skipping tests would drop coverage to 85%. Please confirm if this risk is acceptable."
}

Reference Documents

When making decisions, consult:

conductor/workflow.md — Coverage targets and quality gates
jest.config.js — Current coverage configuration
Existing tests — Patterns and conventions
CLAUDE.md — TDD methodology section

What to Test vs Skip

Always Test

Core business logic and rules
State management and propagation
Price calculations
API request/response handling
Form validation logic
Authentication flows
Credit/billing deduction logic

Skip or Minimize

CSS styling
Third-party library internals
Static content rendering
Console logs and debugging code
Simple pass-through functions

Examples

Example 1: Coverage Adjustment

Question: "Can we set lower coverage for the UI components in this track?"

Process:

Check current target: Components at 50% minimum
Requested: Lower coverage
Within authority if staying above 50%

Decision:

{
  "lead": "qa",
  "decision_made": true,
  "decision": "Set component coverage target at 55% for this track",
  "reasoning": "Components are primarily visual with minimal logic. 55% covers behavior while not over-testing styling. Above 50% minimum.",
  "coverage_impact": "Component coverage at 55%, overall stays above 70%",
  "test_recommendation": "Focus tests on user interactions, not render output",
  "escalate_to": null
}

Example 2: Skip Business Logic (Escalate)

Question: "Can we skip tests for the dependency resolver since it's complex?"

Process:

Dependency resolver is business-critical
Requires 90% coverage minimum
Outside authority: Cannot skip critical tests

Decision:

{
  "lead": "qa",
  "decision_made": false,
  "decision": null,
  "reasoning": "Dependency resolver is business-critical code",
  "coverage_impact": "Business logic coverage would drop significantly",
  "escalate_to": "user",
  "escalation_reason": "Dependency resolution is business-critical logic. This requires 90% minimum coverage. Complexity is not a valid reason to skip - consider breaking into smaller testable functions."
}

Example 3: Test Type Selection

Question: "Should I write unit tests or integration tests for the Stripe webhook handler?"

Process:

Webhook handler involves external integration
Unit tests with mocks may miss contract issues
Within authority: Test type selection

Decision:

{
  "lead": "qa",
  "decision_made": true,
  "decision": "Use integration tests with Stripe test mode",
  "reasoning": "Webhook handlers should be tested against real Stripe test events to validate signature verification and payload handling. Unit tests with mocked signatures don't catch real contract issues.",
  "coverage_impact": "API route coverage at 85%",
  "test_recommendation": "Use stripe-cli to send test webhooks, verify database updates",
  "escalate_to": null
}

Example 4: Mock Strategy

Question: "Should we mock Supabase or use a test database?"

Process:

Test database provides better confidence
Mocks are faster but may miss issues
Within authority: Mock strategy

Decision:

{
  "lead": "qa",
  "decision_made": true,
  "decision": "Use Supabase local for integration tests, mock for unit tests",
  "reasoning": "Hybrid approach: unit tests stay fast with mocks, integration tests use real Supabase local instance for confidence. Matches existing pattern in __tests__/integration/.",
  "coverage_impact": "No change to coverage targets",
  "test_recommendation": "Setup supabase local via docker-compose for CI",
  "escalate_to": null
}

Example 5: Reduce Coverage Below Minimum (Escalate)

Question: "Can we accept 65% overall coverage for this sprint?"

Process:

Minimum is 70%
65% is below minimum
Outside authority: Cannot go below minimum

Decision:

{
  "lead": "qa",
  "decision_made": false,
  "decision": null,
  "reasoning": "65% is below 70% minimum threshold",
  "coverage_impact": "Would create quality debt",
  "escalate_to": "user",
  "escalation_reason": "Overall coverage minimum is 70% per workflow.md. Dropping to 65% would create quality debt and risk regressions. Consider: (1) reducing scope, (2) extending timeline, or (3) accepting risk with documented plan to increase coverage post-sprint."
}

Integration with Orchestrator

The orchestrator invokes this lead when encountering quality questions:

// Orchestrator consults QA lead
const response = await consultLead("qa", {
  question: "What testing approach for the new payment flow?",
  context: {
    track_id: "payments-track-id",
    current_task: "Task 5: Implement Stripe checkout",
    code_area: "payment_processing"
  }
});

if (response.decision_made) {
  // Log consultation and proceed
  metadata.lead_consultations.push(response);
  proceed(response.decision);
} else {
  // Escalate to user
  escalate("user", response.escalation_reason);
}

Quality Gates Checklist

When evaluating track completion, verify:

Coverage Thresholds Met
- Overall >= 70%
- Business logic >= 90%
- API routes >= 80%
Test Quality
- Tests cover happy path
- Tests cover error cases
- Tests are deterministic (no flakes)
No Known Failures
- All tests passing
- No skipped critical tests
Documentation
- Test patterns documented if novel
- Coverage exceptions documented

Source

git clone https://github.com/Ibrahim-3d/conductor-orchestrator-superpowers/blob/master/skills/leads/qa-lead/SKILL.mdView on GitHub

Overview

The QA Lead acts autonomously to define testing strategy, set coverage targets, and validate quality gates for the Conductor orchestrator. It can adjust thresholds within allowed ranges and escalates to the user if skipping critical tests is proposed.

How This Skill Works

It evaluates current test coverage against configured thresholds from workflow guidelines, selects appropriate test types (unit, integration, e2e), and applies standard patterns for mocks and fixtures. When a decision falls outside its authority, it returns an escalation with rationale.

When to Use It

When establishing or adjusting coverage thresholds within 70–90% ranges
When deciding between unit, integration, or end-to-end tests for a module
When defining mock strategy or test data fixtures to ensure repeatable tests
When determining test file organization (co-located vs centralized) per project norms
When a proposed decision would bypass critical tests or reduce minimum coverage, requiring user escalation

Quick Start

Step 1: Review the quality question and determine decision category
Step 2: Check current coverage vs thresholds and applicable guardrails
Step 3: Decide with rationale or escalate to the user if outside authority

Best Practices

Document rationale for any threshold changes and reference the minimums
Follow existing test organization patterns (co-located or centralized)
Use consistent mock strategies and fixture naming across modules
Prefer targeted unit/integration tests before adopting broad e2e tests
Escalate promptly if a change risks missing critical paths

Example Use Cases

Increase overall coverage from 70% to 75% for a new API route within the allowed thresholds
Switch from unit-only tests to include integration tests for a database module
Reduce snapshot testing for a highly dynamic UI, with documented rationale
Raise API Routes coverage from 80% to 85% to meet 85% target
Escalate when a proposed bypass of quality gates could introduce risk

Frequently Asked Questions

Add this skill to your agents