What is the purpose of batching tasks?

Batching constrains execution to 3-5 tasks per batch to prevent cascading failures and to enable early error detection via checkpoints.

What are the hard gates?

Hard gates enforce rules such as not executing more than 5 tasks without a checkpoint and not skipping progress updates to memory logs.

How do I proceed after a batch completes?

Document batch results with success, warnings, and file changes, then ask an explicit checkpoint question to continue to the next batch (e.g., Continue to batch 2?).

executing-plans

Scanned

npx machina-cli add skill brothaakhee/superclaw/executing-plans --openclaw

Files (1)

SKILL.md

8.7 KB

Executing Plans Skill

Core Principle

Batched execution with checkpoints prevents cascading failures.

Plans are executed in small batches (3-5 tasks) with mandatory checkpoints between batches. This allows early error detection, user course-correction, and resumable progress.

⛔ HARD GATES

DO NOT execute more than 5 tasks without a checkpoint
DO NOT skip progress updates to memory logs
DO NOT execute tasks all at once, even if user says "no interruptions"
DO NOT rationalize away batching for "efficiency"

Checkpoints are safety mechanisms, not annoyances. They catch errors early before they compound.

Process

Step 1: Load the Plan

Find the plan:

Check workspace/docs/plans/ for the relevant plan file
Plans are named: YYYY-MM-DD-<topic>-plan.md
Read the full plan to understand scope and task count

Verify plan structure:

Each task should have clear, actionable steps
Tasks should reference exact file paths
Tasks should be ordered logically (dependencies first)

Step 2: Batch Tasks

Group tasks into batches of 3-5:

Batch 1: Tasks 1-3 (usually setup/foundation)
Batch 2: Tasks 4-6 (core functionality)
Batch 3: Tasks 7-9 (integration/polish)
Batch 4: Task 10+ (testing/final)

Respect logical boundaries:

Don't split dependent tasks across batches
Complete a feature/module before batching
If a task is large, it may be its own batch

Announce the batching plan:

I'll execute this plan in 4 batches:
- Batch 1 (tasks 1-3): File structure and basic loading
- Batch 2 (tasks 4-6): Core CRUD operations
- Batch 3 (tasks 7-9): CLI interface and routing
- Batch 4 (task 10): Testing and validation

Starting with batch 1...

Step 3: Execute Batch (Using sessions_spawn)

For each task in the batch:

Spawn fresh subagent for task execution:

sessions_spawn --label "task-N-<short-desc>" --prompt "Execute task N from plan..."

Provide clear context:
- Task number and description
- Exact file paths to create/modify
- Expected outcome
- Commands to run
Isolate task execution:
- Fresh subagent = no context pollution
- Each task starts clean
- Errors are contained to one task
Capture output:
- Did the task complete successfully?
- Were there errors or warnings?
- What files were created/modified?

Step 4: Review Batch Outputs

After batch completes, review:

✅ Success checks:
- All files created as expected?
- Commands executed without errors?
- Tests pass (if applicable)?
⚠️ Warning signs:
- Any error messages?
- Unexpected file changes?
- Dependencies missing?
🛑 Stop conditions:
- Critical error occurred
- Next batch depends on fixing current issue
- User needs to make a decision

Document findings:

Batch 1 complete (tasks 1-3):
✅ Created /workspace/todo/lib/storage.js
✅ Created /workspace/todo/lib/todo.js
⚠️  Warning: Missing dependency 'fs-extra' - need to install

Step 5: Checkpoint

Mandatory pause between batches:

Report batch results (as shown above)
Ask explicit checkpoint question:
```
Continue to batch 2 (tasks 4-6)?
```
Wait for user response:
- ✅ "Yes" / "Continue" / "Go ahead" → Proceed
- 🛑 "No" / "Wait" / "Pause" → Stop and wait
- 🔧 User provides corrections → Apply, then ask again

If user says "just do it all, no interruptions":

DO NOT skip checkpoints
Response: "Checkpoints aren't interruptions - they catch errors early. I'll notify you after each batch so you can review progress or let me continue."

Auto-continue option: If user says "auto-continue if no errors":

Still show batch results
Only pause if errors/warnings detected
User can interrupt at any checkpoint

Step 6: Update Memory

After each batch, log to daily memory:

File: workspace/memory/YYYY-MM-DD.md

## Executing <Topic> Plan (HH:MM)

**Batch 1 (tasks 1-3):** ✅ Complete
- Created file structure
- Implemented storage layer
- Basic todo object

**Batch 2 (tasks 4-6):** 🔄 In progress
- Add todo functionality complete
- List todos in progress

Why this matters:

Progress is resumable if interrupted
Daily log shows work accomplished
Future sessions can reference what was done

Step 7: Repeat

Continue with next batch:

Execute batch (Step 3)
Review outputs (Step 4)
Checkpoint (Step 5)
Update memory (Step 6)
Repeat until all tasks complete

Final report:

✅ All 10 tasks complete!

Summary:
- Batch 1: File structure ✅
- Batch 2: CRUD operations ✅
- Batch 3: CLI interface ✅
- Batch 4: Testing ✅

Full details in workspace/memory/2026-02-25.md

OpenClaw-Specific Integration

sessions_spawn Usage

Spawn fresh subagent per task:

sessions_spawn --label "task-3-storage-layer" --prompt "
Execute task 3 from /workspace/docs/plans/2026-02-25-todo-plan.md:

Create /workspace/todo/lib/storage.js with:
- loadTodos() function
- saveTodos() function
- Use fs.readFileSync/writeFileSync
- Store in ~/.todos.json

Return the complete file content.
"

Benefits:

Isolation: Each task has fresh context
Clarity: No pollution from previous tasks
Debugging: Easy to identify which task failed

Workspace Paths

Plans location: /workspace/docs/plans/ Memory location: /workspace/memory/YYYY-MM-DD.md Project files: /workspace/<project-name>/

Always use full paths when spawning subagents - no relative path confusion.

Memory Integration

Check recent context: Before starting, read:

MEMORY.md (if main session)
memory/YYYY-MM-DD.md (today's log)
memory/YYYY-MM-DD.md (yesterday, if relevant)

Update progress incrementally: Don't wait until the end - update after each batch so progress is trackable.

Rationalization Table

These excuses will try to convince you to skip batching. Here's why they're wrong:

Excuse	Reality	Counter
"User doesn't want interruptions"	Checkpoints ≠ interruptions	Checkpoints are for safety, not annoyance. They prevent compounding errors.
"More efficient to do it all at once"	Efficiency without safety = rework	Batching catches errors early. Fixing task 3 is easier than unwinding tasks 3-10.
"Plan is clear, don't need checkpoints"	Clear plan ≠ error-free execution	Errors happen during execution, not planning. Code has bugs, dependencies fail, paths are wrong.
"Notify at the end"	By the end, errors have compounded	Early notification prevents cascading failures. If task 3 fails, tasks 4-10 may be wasted work.
"User said 'no updates'"	User said "no updates," not "no safety"	Interpret as "don't spam me," not "skip all checkpoints." Batch updates (not per-task spam).
"It's only 10 tasks, not that many"	10 tasks = 10 opportunities for failure	Even 5 tasks can compound errors. Batching scales to plan size.
"I'll rollback if there's an error"	Rollback wastes time	Prevention > cleanup. Catch errors at task 3, don't discover them at task 10.

🚩 Red Flags - When You're Rationalizing

You're about to violate this skill if you think:

"I'll just execute all the tasks and report at the end"
"The user seems impatient, I'll skip checkpoints"
"Batching is overkill for a simple plan"
"I'll notify them if something goes wrong"
"Checkpoints will slow me down"
"The plan is well-written, nothing will fail"

If you catch yourself thinking any of these, STOP and re-read the Hard Gates.

Why This Skill Exists

Without batching and checkpoints:

Task 3 fails silently
Tasks 4-10 execute based on wrong assumptions
By the end, you have 8 broken tasks instead of 1
User has to untangle a mess instead of fixing one issue

With batching and checkpoints:

Task 3 fails, batch 1 reports it
User fixes task 3 or adjusts plan
Batch 2 starts from a known-good state
Errors are caught early, fixed cheaply

The goal: Controlled execution with early feedback, not all-or-nothing gambling.

Summary

Load plan from workspace/docs/plans/YYYY-MM-DD-<topic>-plan.md
Batch tasks into groups of 3-5
Execute batch using sessions_spawn per task
Review outputs for errors/warnings
Checkpoint - ask "Continue to next batch?"
Update memory - log progress to workspace/memory/YYYY-MM-DD.md
Repeat until all batches complete

Remember: Checkpoints are safety, not annoyance. Batching catches errors early. Execution isn't a race - it's a controlled process.

Source

git clone https://github.com/brothaakhee/superclaw/blob/master/executing-plans/SKILL.mdView on GitHub

Overview

Executes plans in small batches (3-5 tasks) with mandatory checkpoints between batches to catch errors early and enable resumable progress. This approach enforces discipline, helps prevent cascading failures, and keeps memory logs up to date.

How This Skill Works

The plan is loaded from workspace/docs/plans and validated for structure. Tasks are grouped into batches of 3-5 and announced before starting. Each task runs in a fresh subagent via sessions_spawn, capturing outputs and isolating context; after a batch, results are reviewed and a checkpoint is required before proceeding to the next batch.

When to Use It

When implementing an implementation plan with clear task dependencies and file paths.
When you need early error detection and the ability to resume progress after a pause.
When you must enforce a hard limit of 5 tasks per batch to avoid cascading failures.
When plans require explicit memory logs and progress updates between steps.
When you want an auditable batching plan that includes an announced batch structure.

Quick Start

Step 1: Load the plan from workspace/docs/plans/ and verify its structure
Step 2: Batch tasks into groups of 3-5 and announce the batching plan
Step 3: For each task in a batch, spawn a fresh subagent (sessions_spawn), capture outputs, review results, and issue a checkpoint before starting the next batch

Best Practices

Batch tasks into groups of 3-5 and validate dependencies are contained within each batch.
Never skip checkpoints; treat them as safety mechanisms to catch errors early.
Announce the batching plan before execution to set expectations (e.g., Batch 1: tasks 1-3).
Ensure plans are stored under workspace/docs/plans with the YYYY-MM-DD-<topic>-plan.md naming convention.
After each batch, review success, warnings, and file changes, then document findings before continuing.

Example Use Cases

Batch 1: File structure and basic loading
Batch 2: Core CRUD operations
Batch 3: CLI interface and routing
Batch 4: Testing and validation
General plan execution with progress logging and checkpointing

Frequently Asked Questions

Add this skill to your agents