What is the core goal of the prompt-engineering workflow?

To systematically optimize prompts for better LLM performance, reliability, and validation.

Which models does it apply to?

Claude, GPT, Gemini and other LLMs; with model-specific considerations for reasoning and non-reasoning architectures.

What techniques are emphasized?

Few-shot examples, XML structure, explicit output formats, prompt chaining, role assignment, long-context handling, and validation with test cases.

prompt-engineering

npx machina-cli add skill OutlineDriven/odin-claude-plugin/prompt-engineering --openclaw

Files (1)

SKILL.md

5.7 KB

Interactive Prompt Optimization Workflow

Execute this workflow to systematically improve any prompt for optimal LLM performance.

Step 1: Analyze Current State

Gather baseline information about your prompt optimization task:

Current prompt: Capture the exact prompt you want to optimize
Target model: Identify the specific model (Claude 4.5, Gemini 3.0, GPT 5.1, etc.)
Use case: Clarify the primary purpose (coding agent, analysis, content generation, conversation)
Failure cases: Document specific examples where current prompt fails or underperforms
Success criteria: Define measurable outcomes (accuracy, format compliance, response time)
Test cases: Create 3-5 representative examples for validation

Step 2: Identify Model Type

Determine the correct prompting approach based on model architecture:

Reasoning Models (Claude 4.x, Gemini 3.0, GPT o-series, DeepSeek-R1):

AVOID explicit CoT phrases like "think step-by-step" or "let's work through this"
PROVIDE rich context with all relevant information upfront
LET the model's internal reasoning handle the thinking process

Non-Reasoning Models (GPT-4o, GPT-4.1, Claude with thinking off):

USE explicit CoT prompting with structured thinking tags
GUIDE the reasoning process with step-by-step instructions
STRUCTURE with <thinking> and <answer> tags

Step 3: Select Core Techniques

Based on your use case, select and apply appropriate techniques:

Essential for ALL prompts:

Few-shot examples: Add 3-5 diverse, representative examples
XML structure: Use tags to separate role, context, examples, task
Output format: Explicitly specify desired response format
Context: Provide all necessary information and constraints

For complex tasks:

Prompt chaining: Break into sequential subtasks with clear handoffs
Role assignment: Define expert persona in system prompt
Long context handling: Place lengthy data first, extract relevant quotes

For coding agents:

Parallel tool calling: Instruct to call independent tools simultaneously
Hallucination prevention: Require reading files before answering
Action bias: Encourage implementation over suggestion
Solution persistence: Emphasize completing tasks end-to-end

For analysis tasks:

Structured thinking: Require step-by-step reasoning
Quote grounding: Extract relevant quotes before analysis
Cross-referencing: Instruct to verify across multiple sources

Step 4: Apply Techniques Systematically

Transform your prompt using selected techniques:

1. Add XML structure:

<role>You are a [domain] expert specializing in [area].</role>

<context>[Relevant background information, constraints, requirements]</context>

<examples
>[3-5 diverse examples showing desired input/output patterns]</examples>

<task>[Specific user request with explicit constraints and output format]</task>

2. Insert representative examples:

Show desired behavior through concrete examples
Use consistent formatting across all examples
Include edge cases if relevant
Demonstrate proper structure and tone

3. Apply model-specific optimization:

Reasoning models: Provide comprehensive context without prescriptive thinking instructions
Non-reasoning models: Add explicit Chain of Thought prompts and thinking tags

4. Specify constraints and format:

Define output structure explicitly
List what to include and what to avoid
Provide length guidelines if applicable
Specify handling of edge cases

Step 5: Test and Validate

Run empirical tests to verify improvements:

1. Test on failure cases:

Run optimized prompt on all documented failure cases
Verify each now produces acceptable output

2. Test edge cases:

Boundary conditions (empty input, maximum length, etc.)
Unusual but valid inputs
Ambiguous scenarios

3. Compare metrics:

Accuracy improvement (before vs after)
Format compliance rate
Response consistency across runs
Execution time (if relevant)

4. Iterate on gaps:

If issues persist: Rephrase instructions for clarity
If format violations: Make format requirements more explicit
If inconsistent: Add more examples showing desired behavior
If slow: Simplify prompt or adjust model parameters

Handle fallback responses:

If model refuses or gives generic responses:
- Increase temperature parameter
- Rephrase request to avoid trigger words
- Check for safety filter activation
- Try different framing of the same task

Step 6: Deliver Optimized Prompt

Package your work for deployment or handoff:

1. Final prompt template:

Complete, production-ready prompt text
Clearly marked sections (role, context, examples, task)
Proper formatting and structure

2. Technique annotations:

Document which techniques were applied and why
Note model-specific considerations
Highlight critical sections

3. Model-specific callouts:

Parameter recommendations (temperature, max_tokens, etc.)
Model family compatibility notes
Performance characteristics

4. Before/after metrics:

Quantitative improvement measurements
Specific failure cases resolved
Remaining limitations or edge cases

5. Usage guidelines:

When to use this prompt
How to adapt for variations
Common pitfalls to avoid

6. Known edge cases:

Scenarios where prompt may still struggle
Recommended fallback approaches
Future improvement opportunities

Source

git clone https://github.com/OutlineDriven/odin-claude-plugin/blob/main/skills/prompt-engineering/SKILL.mdView on GitHub

Overview

An end-to-end workflow for refining prompts to maximize LLM performance. It guides you through analyzing the current prompt, choosing model-specific tactics, and applying structured techniques like XML prompts, few-shot examples, and explicit output formats to validate results.

How This Skill Works

Start by analyzing the prompt state, target model, use case, failure cases, success criteria, and test cases. Then apply core techniques (XML structure, few-shot examples, explicit output format, prompt chaining, and role assignment) and tailor strategies for reasoning versus non-reasoning models, followed by testing and validation.

When to Use It

Optimizing a coding agent prompt for a new language or framework
Improving content generation prompts to meet style and formatting requirements
Adapting prompts for Claude, GPT, Gemini with model-specific techniques
Diagnosing failures and refining success criteria with test cases
Building a reusable XML-structured prompt template for validation

Quick Start

Step 1: Analyze current prompt: capture the exact prompt, target model, use case, failure cases, success criteria, and 3-5 test cases
Step 2: Identify model type (Reasoning vs Non-Reasoning) and select core techniques (few-shot, XML, output format, context)
Step 3: Apply techniques to transform the prompt into an XML structure, insert representative examples, adjust for model-specific optimization, define constraints, and run tests

Best Practices

Include 3-5 diverse few-shot examples to illustrate desired outputs
Use an XML-like structure to separate role, context, examples, and task
Specify explicit output format and constraints, including length and edge cases
Provide complete context upfront with all relevant constraints
Tailor prompts to model type: avoid prescriptive thinking prompts for reasoning models; use thinking tags for non-reasoning models

Example Use Cases

XML-structured prompt guiding a coding task with role and constraints
Few-shot prompt for sentiment analysis with explicit output format
Prompt chaining to break a data extraction task into sequential subtasks
Role-assigned expert persona for a medical QA assistant
Parallel tool-calling for a data pipeline assistant

Frequently Asked Questions

Add this skill to your agents