What is a yak shave score?

A 0–100 metric where 0 means laser-focused and 100 indicates maximum drift from the initial goal within a session.

How do I run the analyzer?

Use the slash command /specstory-yak or run the CLI: python scripts/analyze.py with options like --days, --from, --to, --top, --json, etc.

Where are results stored?

Results can be printed to the console, or saved to a file via -o/--output (md or json).

specstory-yak

Scanned

npx machina-cli add skill specstoryai/agent-skills/specstory-yak --openclaw

Files (1)

SKILL.md

7.0 KB

Specstory Yak Shave Analyzer

Analyzes your .specstory/history to detect when coding sessions drifted off track from their original goal. Produces a "yak shave score" for each session.

How It Works

Parses specstory history files from a date range (or all recent sessions)
Extracts the initial user intent from the first message
Tracks domain shifts: file references, tool call patterns, goal changes
Scores each session from 0 (laser focused) to 100 (maximum yak shave)
Summarizes your worst offenders and patterns

What Is Yak Shaving?

"I need to deploy my app, but first I need to fix CI, but first I need to update Node, but first I need to fix my shell config..."

Yak shaving is when you start with Goal A but end up deep in unrelated Task Z. This skill detects that pattern in your AI coding sessions.

Usage

Slash Command

When invoked via /specstory-yak, interpret the user's natural language:

User says	Script args
`/specstory-yak`	`--days 7` (default)
`/specstory-yak last 30 days`	`--days 30`
`/specstory-yak this week`	`--days 7`
`/specstory-yak top 10`	`--top 10`
`/specstory-yak january`	`--from 2026-01-01 --to 2026-01-31`
`/specstory-yak from jan 15 to jan 20`	`--from 2026-01-15 --to 2026-01-20`
`/specstory-yak by modification time`	`--by-mtime`
`/specstory-yak last 14 days as json`	`--days 14 --json`
`/specstory-yak save to yak-report.md`	`-o yak-report.md`
`/specstory-yak last 90 days output to report`	`--days 90 -o report.md`

Direct Script Usage

python /path/to/skills/specstory-yak/scripts/analyze.py [options]

Arguments:

--days N - Analyze last N days (default: 7)
--from DATE - Start date (YYYY-MM-DD)
--to DATE - End date (YYYY-MM-DD)
--path PATH - Path to .specstory/history (auto-detects if not specified)
--top N - Show top N worst yak shaves (default: 5)
--json - Output as JSON
--verbose - Show detailed analysis
--by-mtime - Filter by file modification time instead of filename date
-o, --output FILE - Write report to file (auto-adds .md or .json extension)

Examples:

# Analyze last 7 days
python scripts/analyze.py

# Analyze last 30 days, show top 10
python scripts/analyze.py --days 30 --top 10

# Analyze specific date range
python scripts/analyze.py --from 2026-01-01 --to 2026-01-28

# Filter by when files were modified (not session start time)
python scripts/analyze.py --days 7 --by-mtime

# JSON output for further processing
python scripts/analyze.py --days 14 --json

# Save report to a markdown file
python scripts/analyze.py --days 90 -o yak-report.md

# Save JSON to a file
python scripts/analyze.py --days 30 --json -o yak-data.json

Output

Yak Shave Report (2026-01-21 to 2026-01-28)
==========================================

Sessions analyzed: 23
Average yak shave score: 34/100

Top Yak Shaves:
---------------
1. [87/100] "fix button alignment" (2026-01-25)
   Started: CSS fix for button
   Ended up: Rewriting entire build system
   Domain shifts: 4 (ui -> build -> docker -> k8s)

2. [72/100] "add logout feature" (2026-01-23)
   Started: Add logout button
   Ended up: Refactoring auth system + session management
   Domain shifts: 3 (ui -> auth -> database)

3. [65/100] "update readme" (2026-01-22)
   Started: Documentation update
   Ended up: CI pipeline overhaul
   Domain shifts: 2 (docs -> ci -> testing)

Most Focused Sessions:
----------------------
1. [5/100] "explain auth flow" (2026-01-26) - Pure analysis, no drift
2. [8/100] "fix typo in config" (2026-01-24) - Quick surgical fix

Patterns Detected:
------------------
- You yak shave most on: UI tasks (avg 58/100)
- Safest task type: Code review/explanation (avg 12/100)
- Peak yak shave hours: 11pm-2am (avg 71/100)

Scoring Methodology

The yak shave score (0-100) is computed from:

Factor	Weight	Description
Domain shifts	40%	How many times file references jumped domains
Goal completion	25%	Did the original stated goal get completed?
Session length ratio	20%	Length vs. complexity of original ask
Tool type cascade	15%	Read->Search->Edit->Create->Deploy escalation

Score interpretation:

0-20: Laser focused
21-40: Minor tangents
41-60: Moderate drift
61-80: Significant yak shaving
81-100: Epic rabbit hole

Present Results to User

IMPORTANT: After running the analyzer script, you MUST add a personalized LLM-generated summary at the very top of your response, BEFORE showing the raw report output.

LLM Summary Guidelines

Generate a 3-5 sentence personalized commentary that:

Opens with a verdict - A witty one-liner about the overall state (e.g., "Your coding sessions this week were... an adventure." or "Remarkably disciplined! Someone's been taking their focus vitamins.")
Calls out the highlight - Reference the most notable session specifically:
- If high yak shave: "That January 25th button fix that somehow became a Kubernetes migration? Chef's kiss of scope creep."
- If low yak shave: "Your January 26th auth flow explanation was surgical - in and out, no detours."
Identifies a pattern - Note any recurring theme:
- "You seem to yak shave most when starting with UI tasks"
- "Late night sessions are your danger zone"
- "Your refactoring sessions tend to stay focused"
Ends with actionable advice or a joke - Either:
- A practical tip: "Consider time-boxing those 'quick CSS fixes' - they have a 73% yak shave rate"
- Or a joke: "At this rate, your next typo fix will result in a complete rewrite of the Linux kernel"

Example LLM Summary

## 🐃 Your Yak Shave Analysis

Well, well, well. You came to fix buttons and left having rewritten half the
infrastructure. Your average yak shave score of 47/100 puts you firmly in
"classic developer behavior" territory.

The standout? That January 25th session where a CSS alignment fix somehow
evolved into a full Kubernetes deployment overhaul. Four domain shifts later,
you probably forgot what a button even looks like.

Pattern I noticed: Your UI tasks have a 58% higher yak shave rate than your
code review sessions. Maybe start labeling those "quick UI fixes" as
"potential 3-hour adventures" in your calendar.

Here's the full breakdown:

Then show the raw report output below your summary.

What to Highlight

After your summary, when presenting the raw results:

The worst offenders with before/after comparison
Patterns in when/what causes yak shaving
Actionable insight - what task types to watch out for

Source

git clone https://github.com/specstoryai/agent-skills/blob/main/skills/specstory-yak/SKILL.mdView on GitHub

Overview

Specstory Yak Shave Analyzer analyzes your .specstory/history to identify when coding sessions drift away from the original goal. It assigns a yak shave score from 0 (laser focused) to 100 (maximum drift) per session and surfaces patterns and worst offenders for quick fixes.

How This Skill Works

Parses history files for a date range, extracts the initial intent, and tracks domain shifts such as file references and tool calls. It then scores each session 0–100 and summarizes the worst offenders and patterns.

When to Use It

You want to know if recent SpecStory sessions stayed on goal
You want to audit sprints for productivity and consistency
You need a post-run report to identify recurring distractions
You want to compare current sessions to earlier ones
You prefer automated reports (JSON) for dashboards

Quick Start

Step 1: Run the analyzer over your history (e.g., python scripts/analyze.py --days 14)
Step 2: Open the generated report or JSON file to review yak shave scores and patterns
Step 3: Use the findings to tighten goals and reduce drift in future sessions

Best Practices

Run analysis over a defined date range to contextualize trends
Review top yak shave sessions and map back to initial goals
Filter by top N to focus on biggest distractions
Use --by-mtime or date-based filters to align with your workflow
Combine with JSON output for integration into dashboards

Example Use Cases

Top Yak Shaves: 1) 87/100 'fix button alignment' began as a UI tweak but ended in a rewrite of the build system (domain shifts: ui -> build -> docker -> k8s)
Top Yak Shaves: 2) 72/100 'add logout feature' drifted from UI/auth to authentication/session management
Top Yak Shaves: 3) 65/100 'update readme' started as docs but led to CI pipeline overhaul
Most Focused Session: 1) 5/100 'explain auth flow' remained laser-focused on explaining the authentication flow
An example mid-range session: 34/100 with modest domain hops and actionable lessons

Frequently Asked Questions

Add this skill to your agents