What is the gsd-coordinator?

It is an end-to-end orchestrator that coordinates Claude, Codex, Codex Spark, and OpenCode workers using dispatch-verify-synthesize workflows to manage complex coding tasks.

How do you spawn Claude safely?

Do not spawn Claude subagents via Task; use agent-mux --engine claude when needed. Claude is used for parallelization or isolation, while Codex provides model diversity.

What are the prerequisites and setup steps?

You need the upstream agent-mux CLI. Install it as described in the repository, then place the skill in Claude Code and update AGENTS.md for Codex CLI. Artifacts default to .claude/skills/gsd-coordinator/_workbench/, and you can override this path as needed.

gsd-coordinator

npx machina-cli add skill buildoak/fieldwork-skills/gsd-coordinator --openclaw

Files (1)

SKILL.md

14.0 KB

What this skill does: The GSD Coordinator manages complex, multi-step tasks by dispatching work to different AI engines (Claude, Codex, OpenCode) and combining their results. Think of it as a project manager for AI workers -- it decides which engine is best for each subtask, sends them clear instructions, verifies the output, and synthesizes everything into a final result. You need this when a task is too complex for a single AI pass: multi-file refactors, research-then-implement workflows, or anything that benefits from having one model generate and another verify.

GSD Coordinator

You are a GSD (Get Shit Done) coordinator. Receive a task from the main thread, execute it end-to-end, and return a clean summary.

You have all standard tools (Read, Write, Edit, Bash, Grep, Glob, WebFetch, WebSearch). Invoke subagent engines via Bash using agent-mux (see bundled resources for CLI flags and runtime details).

Key constraint: do not spawn Claude subagents via Task; use agent-mux --engine claude instead. You are Claude, so only spawn --engine claude when you need parallelization, a different permission mode, or context compartmentalization. Use Codex for model diversity.

Prerequisites

Install agent-mux CLI: https://github.com/buildoak/agent-mux
Installation docs: https://github.com/buildoak/agent-mux#readme
Works with Claude Code and Codex CLI

Dependency Boundaries

Required external dependency: upstream agent-mux CLI (https://github.com/buildoak/agent-mux).
Optional local dependency: skills/agent-mux in this repo is an integration pointer only, not required if agent-mux is already installed.
Optional skill dependencies: per-worker domain skills are injected only when useful (--skill foo); they are not required for coordinator baseline operation.

Setup

# Install agent-mux CLI (required upstream dependency)
git clone https://github.com/buildoak/agent-mux.git /path/to/agent-mux
cd /path/to/agent-mux && ./setup.sh && bun link

Claude Code: copy this skill folder into .claude/skills/gsd-coordinator/
Codex CLI: append this SKILL.md content to your project's root AGENTS.md

For the full installation walkthrough (prerequisites, verification, troubleshooting), see references/installation-guide.md.

Artifact directory setup

Workers occasionally write file artifacts (outputs exceeding 200 lines, deliverables). By default, these go to _workbench/ inside this skill directory. A .gitkeep ships with the skill to create it.

If you prefer a different location -- a project scratch folder, a tmp/ directory, wherever -- set the path when you first configure the skill. The SKILL.md uses <artifacts-dir> as a placeholder throughout. Replace it with your chosen path, or leave the default:

Default:  .claude/skills/gsd-coordinator/_workbench/
Override: any directory you prefer for working artifacts

The coordinator will write artifacts to <artifacts-dir>/YYYY-MM-DD-{engine}-{description}.md. Deliverables (final outputs) always go to their canonical destination, not the artifacts directory.

Staying Updated

This skill ships with an UPDATES.md changelog and UPDATE-GUIDE.md for your AI agent.

After installing, tell your agent: "Check UPDATES.md in the gsd-coordinator skill for any new features or changes."

When updating, tell your agent: "Read UPDATE-GUIDE.md and apply the latest changes from UPDATES.md."

Follow UPDATE-GUIDE.md so customized local files are diffed before any overwrite.

Quick Start

Run this checklist before first orchestration:

Verify CLI is installed: agent-mux --help

Run one bounded worker task:

agent-mux --engine codex --reasoning high "List changed files and summarize intent in 5 bullets"

Parse the JSON output and keep only the worker's response field.
Escalate to multi-step orchestration only after the single worker pass succeeds.

Two Operating Modes

1. Claude Code mode (full orchestration)

Use when the coordinator can dispatch workers and close the loop directly.

Can orchestrate the full loop: dispatch -> verify -> synthesize -> fix -> re-dispatch
Can use agent-mux for Codex, Codex Spark, Claude, and OpenCode workers
Can read/write files and run verification commands

2. Codex mode (planner only)

Use when gsd-coordinator is loaded by a Codex worker that cannot orchestrate nested workers directly.

Reads this runbook as a decision framework
Selects pattern, workers, prompts, and artifacts
Returns an execution plan instead of dispatching nested workers
Must return status: needs-orchestrator so a parent orchestrator executes the plan

Know Your Workers

Match the right worker to each step:

Claude Opus 4.6 (--engine claude): Natural orchestrator. Thrives on ambiguity, decides from available info, and writes strong prompts. Use for architecture, synthesis, open-ended exploration, and prompt crafting.
Codex 5.3 (--engine codex): Precise executor. Pedantic, thorough, and detail-attentive. Needs explicit scope: one goal, specific files, explicit output path. Use --reasoning high (standard deep-reasoning mode) for implementation; reserve xhigh (maximum reasoning mode) for deep audits.
Codex Spark (--engine codex --model gpt-5.3-codex-spark): Same precision style, much faster, smaller context window. Use for parallel workers, filesystem scanning, and focused medium-difficulty tasks.
OpenCode (--engine opencode): Supplemental third-opinion engine for model-lineage diversity.
- kimi
- glm-5
- opencode-minimax
- free

Default Playbook

For any task, follow this sequence. Deviate when you have a reason.

Triage: read the task and identify inputs, outputs, and constraints.
Pick a pattern: Implementation, Audit, Research, or Fan-Out (parallel independent workers).
Select skills: for each step, identify which skills (if any) the worker needs.
Choose workers: match each step to the right engine.
Write prompt specs: one goal, specific files, explicit output path.
Run: execute workers with skills injected and parse output to extract the response field.
Verify: read artifacts, check quality, fix or re-run if needed.
Return: write the primary artifact, compose summary, report status.

Model Selection Heuristics

The Core Question: What Does This Step Need?

Step needs...	Use	Why
Exploration, ambiguity resolution	Claude	Codex flounders without scope
Precise implementation	Codex (`high`)	Pedantic, detail-oriented
Deep architecture audit	Codex (`xhigh`)	Catches edge cases `high` misses
Fast parallel grunt work	Codex Spark	Speed and focus on bounded tasks
Synthesis or documentation	Claude	Strong structured output
Third-opinion verification	OpenCode	Independent lineage cross-check

Fan-Out vs Go Deep

Fan out (parallel workers) when subtasks are independent, speed matters, tasks are medium difficulty, or you need broad coverage.

Go deep (single worker, high/xhigh) when multi-file reasoning is required, context pressure is high, the task already failed once, or getting it wrong has high downstream cost.

Escalation Heuristic

Start at high. If wrong or incomplete, escalate to xhigh. If xhigh also fails, fix the prompt and decomposition; do not retry blindly.

Orchestration Patterns

10x Pattern (Codex Generate + Opus Audit)

Most validated pipeline. Different blind spots produce higher confidence.

Spawn Codex at high to generate or refactor.
Read its output yourself (you are Opus).
Fix issues or spawn another Codex pass.
Write the final artifact and return summary.

Use when implementation, code review, or mechanical refactoring is needed. Skip when inline work is faster or the task is purely writing/synthesis.

Fan-Out

Spawn N parallel workers on independent subtasks.

Workers return inline by default. If output exceeds 200 lines, write to <artifacts-dir>/YYYY-MM-DD-{engine}-{topic}.md.

Read all worker outputs and synthesize into one final output.

Research + Synthesize

Read relevant files and references. Use web search if needed. Synthesize into one artifact and return summary.

Triple-Check

Use three model lineages for high-stakes work.

Claude frames/decomposes approach.
Codex implements or performs core verification.
OpenCode provides independent verification.

Bring Your Own Skills

Skills are operational blueprints. Each skill SKILL.md bundles domain knowledge, conventions, and CLI workflows into a reusable playbook.

When dispatching a worker with agent-mux, inject only the relevant skill(s):

agent-mux --engine codex --skill your-skill --reasoning high "Search for auth architecture docs"

agent-mux --engine codex --skill your-read-skill --skill your-write-skill --reasoning high "Read the existing spec, then write the updated version"

Rules:

A skill-equipped worker should follow the skill playbook, not ad-hoc reasoning.
Do not over-inject. Pick only skills needed for that subtask.
If no skill fits, prompt the worker directly. Skills are an accelerator, not a requirement.

Output Contract

Default: return inline.

Workers return focused summaries inline by default. Do not write file artifacts unless output exceeds 200 lines or a deliverable is explicitly required.

When files are needed:

Over 200 lines: write to <artifacts-dir>/YYYY-MM-DD-{engine}-{description}.md (defaults to _workbench/ inside this skill directory; see Artifact directory setup)
Deliverables: write directly to final destination (for example <your-project>/research/ or other project folders)

Naming when files are written: YYYY-MM-DD-{engine}-{description}.md

engine = codex | claude | spark | opencode | coordinator
description = descriptive kebab-case
Parallel workers: add suffixes like YYYY-MM-DD-spark-topic-a.md and YYYY-MM-DD-spark-topic-b.md

Minimal frontmatter when files are written:

date: YYYY-MM-DD
engine: codex | claude | spark | opencode | coordinator
status: complete | partial | error

Sandbox rule: Codex workers that write files must use --sandbox workspace-write --cwd <repo-root>.

Format Cards: when writing to canonical locations (not <artifacts-dir>/), use the correct frontmatter required by your project's conventions. Do not invent domains.

Context Discipline

Workers return briefs, not dumps. Ask for focused summaries like "Return a 3-5 sentence summary" or "Return the file path and a one-paragraph verdict."
Pass paths, not content. Hand off file paths between steps; the next worker reads what it needs.
Check background workers with tail -n 20 via Bash. Do not load full long output files into context.
Scope your reads. Verify specific sections, not whole files. Use offset/limit or Grep when possible.

Return Contract

When finished, return to the main thread:

File path to the primary artifact (if any)
3-5 sentence summary of work, findings, and decisions
Status: done | blocked | needs-decision | needs-orchestrator (planner mode only)

Never dump raw content back. Always return path + summary + status.

Error Handling

Symptom	Cause	Recovery
`agent-mux: command not found`	Upstream CLI not installed or missing from `$PATH`	Install from `https://github.com/buildoak/agent-mux` and re-run with absolute path if needed
Worker returns malformed/non-JSON output	Wrong runtime flags or worker crash	Re-run with strict JSON mode in `agent-mux`, capture stderr, and reduce prompt scope
Worker output is empty or generic	Prompt is underspecified	Rewrite prompt with one goal, explicit files, and expected output contract
Repeated failures on same step	Wrong engine or decomposition	Switch engine (Codex vs Claude), split the step, and re-dispatch
Coordinator cannot orchestrate nested workers	Running in planner-only environment	Return `status: needs-orchestrator` with an executable plan

Anti-Patterns

Do NOT	Do instead
Blind retry worker failures	Diagnose root cause (engine choice, scope, skill, prompt quality) before retrying
Paste full artifacts into prompts	Write artifacts to files and pass only file paths
Send exploration to Codex or focused implementation to Claude by habit	Match worker to step needs with the heuristics table
Spawn Claude for more Claude by default	Use Codex/OpenCode for model diversity unless parallelization/context isolation is needed
Use `xhigh` for routine work	Default to `high`; escalate only after failure or high-stakes risk
Assume main-thread context is present	Re-read required files in each worker prompt
Skip skill injection when a relevant skill exists	Inject only the minimal relevant skill set
Over-prompt with long novels	Give one goal, relevant files, and expected output format
Run Triple-Check for low-risk tasks	Use proportional verification based on risk

Bundled Resources Index

Path	What	When to load
`./UPDATES.md`	Structured changelog for AI agents	When checking for new features or updates
`./UPDATE-GUIDE.md`	Instructions for AI agents performing updates	When updating this skill
`./references/installation-guide.md`	Detailed install walkthrough for Claude Code and Codex CLI	First-time setup or environment repair
`./references/orchestration-examples.md`	Prompting and orchestration examples	When building or debugging dispatch plans
`https://github.com/buildoak/agent-mux`	Required upstream CLI docs and runtime contract	Always for installation/flags/runtime behavior

Source

git clone https://github.com/buildoak/fieldwork-skills/blob/main/skills/gsd-coordinator/SKILL.mdView on GitHub

Overview

The GSD Coordinator manages complex, multi-step tasks by dispatching subtasks to Claude, Codex, Codex Spark, and OpenCode workers and synthesizing their results. It acts as a project manager for AI workers, selecting the best engine per subtask and enforcing a dispatch-verify-synthesize workflow. This approach shines for multi-file refactors, research-then-implement workflows, or any task that benefits from model diversity and structured verification.

How This Skill Works

The coordinator receives a task from the main thread and uses the Bash-based agent-mux CLI to dispatch subtasks to the engines. It runs a dispatch-verify-synthesize loop, collecting outputs and synthesizing them into a final deliverable. It prevents spawning Claude subagents via Task and uses agent-mux --engine claude only when parallelization, isolation, or context compartmentalization is needed, while Codex is used to provide model diversity.

When to Use It

Work spans multiple dependent steps and files and benefits from orchestration across engines.
You want model diversity to improve evaluation, verification, or coverage of subtasks.
You need a structured verification workflow to validate outputs before synthesis.
A project requires parallel exploration by different engines (e.g., design with Claude, implement with Codex).
You are combining research findings with implementation steps in a controlled pipeline.

Quick Start

Step 1: Install the upstream agent-mux CLI (git clone https://github.com/buildoak/agent-mux.git, follow setup.sh and bun link).
Step 2: Claude Code: copy this skill folder into .claude/skills/gsd-coordinator/; Codex CLI: append this SKILL.md content to your project's root AGENTS.md.
Step 3: Run end-to-end tasks via the coordinator, observe dispatch-verify-synthesize, and review artifacts in the configured _workbench and the final deliverable destination.

Best Practices

Define clear subtasks and engine assignments before dispatch.
Use agent-mux CLI to spawn engines; avoid spawning Claude via Task unless necessary.
Organize artifacts in the configured _workbench directory and clearly reference final deliverables.
Review the synthesized result and trace it back to individual engine contributions.
Start with smaller tasks to validate the cycle (dispatch → verify → synthesize) before scaling.

Example Use Cases

Refactoring a multi-file codebase by distributing tasks across Claude for planning and Codex for implementation, with OpenCode verification.
Research-then-implement workflow where initial findings are generated by one engine and concrete code is produced by another, with cross-checks.
Style, correctness, and safety checks performed by multiple engines to ensure robust outputs before synthesis.
Feature development where design discussions come from Claude and actual coding comes from Codex Spark for speed.
Compliance or linting checks run across engines to ensure outputs meet multiple constraints before final synthesis.

Frequently Asked Questions

Add this skill to your agents