What is ensemble-solving?

A technique to generate three diverse solutions in parallel and pick the best using weighted evaluation.

When should I avoid ensemble-solving?

For simple lookups, single-cause bug fixes, file operations, deterministic changes, or tasks with one obvious solution.

How are results evaluated?

Using a weighted rubric (Correctness 30%, Completeness 20%, Quality 20%, Clarity 15%, Elegance 15%) with task-type adjustments.

ensemble-solving

npx machina-cli add skill mhattingpete/claude-skills-marketplace/ensemble-solving --openclaw

Files (1)

SKILL.md

4.9 KB

Ensemble Problem Solving

Generate multiple solutions in parallel by spawning 3 subagents with different approaches, then evaluate and select the best result.

When to Use

Activation phrases:

"Give me options for..."
"What's the best way to..."
"Explore different approaches..."
"I want to see alternatives..."
"Compare approaches for..."
"Which approach should I use..."

Good candidates:

Architecture decisions with trade-offs
Code generation with multiple valid implementations
API design with different philosophies
Naming, branding, documentation style
Refactoring strategies
Algorithm selection

Skip ensemble for:

Simple lookups or syntax questions
Single-cause bug fixes
File operations, git commands
Deterministic configuration changes
Tasks with one obvious solution

What It Does

Analyzes the task to determine if ensemble approach is valuable
Generates 3 distinct prompts using appropriate diversification strategy
Spawns 3 parallel subagents to develop solutions independently
Evaluates all solutions using weighted criteria
Returns the best solution with explanation and alternatives summary

Approach

Step 1: Classify Task Type

Determine which category fits:

Code Generation: Functions, classes, APIs, algorithms
Architecture/Design: System design, data models, patterns
Creative: Writing, naming, documentation

Step 2: Invoke Ensemble Orchestrator

Task tool with:
- subagent_type: 'ensemble-orchestrator'
- description: 'Generate and evaluate 3 parallel solutions'
- prompt: [User's original task with full context]

The orchestrator handles:

Prompt diversification
Parallel execution
Solution evaluation
Winner selection

Step 3: Present Result

The orchestrator returns:

The winning solution (in full)
Evaluation scores for all 3 approaches
Why the winner was selected
When alternatives might be preferred

Diversification Strategies

For Code (Constraint Variation):

Approach	Focus
Simplicity	Minimal code, maximum readability
Performance	Efficient, optimized
Extensibility	Clean abstractions, easy to extend

For Architecture (Approach Variation):

Approach	Focus
Top-down	Requirements → Interfaces → Implementation
Bottom-up	Primitives → Composition → Structure
Lateral	Analogies from other domains

For Creative (Persona Variation):

Approach	Focus
Expert	Technical precision, authoritative
Pragmatic	Ship-focused, practical
Innovative	Creative, unconventional

Evaluation Rubric

Criterion	Base Weight	Description
Correctness	30%	Solves the problem correctly
Completeness	20%	Addresses all requirements
Quality	20%	How well-crafted
Clarity	15%	How understandable
Elegance	15%	How simple/beautiful

Weights adjust based on task type.

Example

User: "What's the best way to implement a rate limiter?"

Skill:

Classifies as Code Generation
Invokes ensemble-orchestrator
Three approaches generated:
- Simple: Token bucket with in-memory counter
- Performance: Sliding window with atomic operations
- Extensible: Strategy pattern with pluggable backends
Evaluation selects extensible approach (score 8.4)
Returns full implementation with explanation

Output:

## Selected Solution

[Full rate limiter implementation with strategy pattern]

## Why This Solution Won

The extensible approach scored highest (8.4) because it provides
a clean abstraction that works for both simple use cases and
complex distributed scenarios. The strategy pattern allows
swapping Redis/Memcached backends without code changes.

## Alternatives

- **Simple approach**: Best if you just need basic in-memory
  limiting and will never scale beyond one process.

- **Performance approach**: Best for high-throughput scenarios
  where every microsecond matters.

Success Criteria

3 genuinely different solutions generated
Clear evaluation rationale provided
Winner selected with confidence
Alternatives summarized with use cases
User understands trade-offs

Token Cost

~4x overhead vs single attempt. Worth it for:

High-stakes architecture decisions
Creative work where first attempt rarely optimal
Learning scenarios where seeing alternatives is valuable
Code that will be maintained long-term

Integration

feature-planning: Can ensemble architecture decisions
code-auditor: Can ensemble analysis perspectives
plan-implementer: Executes the winning approach

Source

git clone https://github.com/mhattingpete/claude-skills-marketplace/blob/main/engineering-workflow-plugin/skills/ensemble-solving/SKILL.md

View on GitHub

Overview

Ensemble solving generates three diverse solutions in parallel and selects the best. This approach surfaces trade-offs, explores alternatives, and raises quality for architecture decisions, code generation, and creative tasks.

How This Skill Works

It analyzes the task to determine if an ensemble is valuable, creates three distinct prompts using diversification strategies, and spawns three parallel subagents to develop solutions. All results are evaluated with weighted criteria and the winner is returned along with explanations and an alternatives summary.

When to Use It

Architecture decisions with trade-offs (e.g., choosing a data model or deployment pattern).
Code generation with multiple valid implementations (e.g., function or API designs).
API design with different philosophies (REST vs GraphQL vs RPC).
Naming, branding, or documentation style decisions.
Refactoring strategies or algorithm selection (balancing constraints and trade-offs).

Quick Start

Step 1: Classify the task type (Code, Architecture, Creative) and set diversification goals.
Step 2: Invoke the ensemble orchestrator to generate and evaluate 3 parallel solutions.
Step 3: Review the winner and alternatives, then implement with a documented rationale.

Best Practices

Define evaluation criteria and weights up front (correctness, completeness, quality, clarity, elegance).
Choose diversification strategies aligned with the task type (code, architecture, or creative).
Limit prompts and time to keep iterations productive.
Document the rationale for the winner and summarize the alternatives.
Verify the winner against real constraints and tests before implementation.

Example Use Cases

Evaluating three architectural patterns (monolith, microservices, modular monolith) for a new system.
Generating three code implementations of a critical function and selecting the cleanest one.
Designing API endpoints with different philosophies (REST, GraphQL, or RPC).
Exploring naming and branding options for a new feature.
Outlining three refactoring strategies for a legacy module and choosing the most sustainable path.

Frequently Asked Questions

Add this skill to your agents