What is response compression?

A set of rules to trim fluff and deliver concise, token-efficient replies.

How many tokens can be saved?

Typically 200-400 tokens per response, depending on content and structure.

When should I not use it?

Avoid in educational explanations or first-time setup instructions that require step-by-step clarity.

response-compression

Scanned

optimization

npx machina-cli add skill athola/claude-night-market/response-compression --openclaw

Files (1)

SKILL.md

5.8 KB

Elimination Rules
Before/After Transformations
Termination Guidelines
Directness Guidelines
Quick Reference Checklist
Token Impact
Integration

Response Compression

Eliminate response bloat to save 200-400 tokens per response while maintaining clarity.

When To Use

Reducing verbose output to save context tokens
Providing concise answers without losing information

When NOT To Use

Educational explanations where detail improves understanding
First-time setup instructions needing step-by-step clarity

Elimination Rules

ELIMINATE

Category	Examples	Replacement
Decorative Emojis	`--`	(remove entirely)
Filler Words	"just", "simply", "basically", "essentially"	(remove or rephrase)
Hedging Language	"might", "could", "perhaps", "potentially", "I think"	Use factual statements
Hype Words	"powerful", "amazing", "seamless", "robust", "elegant"	Use precise descriptors
Conversational Framing	"Let's dive in", "Now that we've", "Moving forward"	Start with content
Transitions	"Furthermore", "Additionally", "In conclusion"	(remove, use structure)
Call-to-Action	"Feel free to", "Don't hesitate to", "Let me know if"	(remove from endings)

PRESERVE (When Appropriate)

Category	Example	When to Use
Status Indicators	`[pass] [fail] [warn]`	In structured output, checklists
Technical Precision	Exact error messages	When debugging
Safety Warnings	Critical info about data loss	Always preserve
Context Setting	Brief necessary background	When user lacks context

Before/After Transformations

Example 1: Opening Bloat

Before (68 tokens):

Great question! I'd be happy to help you understand how this works.
The bloat detector is a powerful tool that analyzes your codebase
and provides comprehensive insights into potential issues...

After (28 tokens):

The bloat detector analyzes codebases using three tiers: quick scan
(heuristics), static analysis (tools), and deep audit (git history).

Example 2: Closing Bloat

Before (45 tokens):

I've completed the task. Here's what I did:
- Deleted 5 files
- Saved 18k tokens

Next steps:
1. Review the changes
2. Run tests
3. Commit if satisfied

Let me know if you need anything else!

After (15 tokens):

Done. Deleted 5 files, saved 18k tokens.
Backup: backup/unbloat-20260102

Example 3: Hedging Removal

Before:

I think this might potentially be causing the issue, but I could be wrong.
Perhaps we should consider looking into it further.

After:

This causes the issue. Investigate the connection pool timeout setting.

Termination Guidelines

When to Stop

End response immediately after:

Delivering requested information
Completing requested task
Providing necessary context

Avoid Trailing Content

Pattern	Action
"Next steps:"	Remove unless safety-critical
"Let me know if..."	Remove always
"Summary:"	Remove (user has the response)
"Hope this helps!"	Remove always
Bullet recaps	Remove (redundant)

Exceptions (When Summaries Help)

Multi-part tasks with many changes
User explicitly requests summary
Critical rollback/backup information
Complex debugging with multiple findings

Directness Guidelines

Direct =/= Rude

Goal: Information density, not coldness.

Eliminate	Preserve
Unnecessary encouragement	Technical context
Rapport-building filler	Safety warnings
Hedging without reason	Necessary explanations
Positive padding	Factual uncertainty markers

Encouragement Bloat

Eliminate:

"Great question!"
"Excellent point!"
"Good thinking!"
"That's a great approach!"

Replace with: Direct answers to the question.

Rapport-Building Filler

Eliminate:

"I'd be happy to help you..."
"Feel free to ask if..."
"I hope this helps!"
"Let me know if you need..."

Replace with: Useful information or nothing.

Preserve Helpful Directness

The following are NOT bloat:

Brief context when user needs it
Clarifying questions when ambiguity affects correctness
Warnings about destructive operations
Error explanations that help debugging

Quick Reference Checklist

Before finalizing response:

Token Impact

Pattern	Typical Savings
Eliminating opening bloat	30-50 tokens
Removing closing fluff	20-40 tokens
Cutting filler words	10-20 tokens
Removing emoji	5-15 tokens
Direct answers	50-100 tokens
Total per response	150-350 tokens

Over 1000 responses: 150k-350k tokens saved.

Integration

This skill works with:

conserve:token-conservation - Budget tracking
conserve:context-optimization - MECW management
sanctum:code-review - Review feedback

Source

git clone https://github.com/athola/claude-night-market/blob/master/plugins/conserve/skills/response-compression/SKILL.md

View on GitHub

Overview

Response compression trims verbose output to improve clarity and token efficiency. It eliminates filler words, hedging, hype terms, and unnecessary framing while preserving essential content like status indicators and critical error messages. The result is concise, actionable replies that save 200-400 tokens per response.

How This Skill Works

Rules define what to ELIMINATE (decorative emojis, filler words, hedging language, hype words, conversational framing, transitions, call-to-action) and what to PRESERVE (status indicators, exact error messages, safety warnings, essential context). The tool rewrites prompts with before/after transformations, delivering content-first responses and terminating cleanly to avoid trailing content.

When to Use It

Reducing verbose output to save context tokens
Providing concise answers without losing information
Delivering quick updates in token-constrained environments
Preserving critical details like exact error messages or safety warnings
Starting content-focused responses instead of conversational framing

Quick Start

Step 1: Identify filler words, hedges, hype terms, and unnecessary framing in the draft.
Step 2: Apply the elimination rules to remove those items, while preserving status indicators and critical messages.
Step 3: Ensure the output ends promptly after delivering the information and contains no trailing content.

Best Practices

Start with content, not introductory framing
Remove filler words and hedging
Use precise descriptors instead of hype words
Preserve status indicators and critical warnings
Apply transformations consistently and test readability

Example Use Cases

The bloat detector analyzes codebases using three tiers: quick scan (heuristics), static analysis (tools), and deep audit (git history).
Done. Deleted 5 files, saved 18k tokens. Backup: backup/unbloat-20260102
This causes the issue. Investigate the connection pool timeout setting.
Status indicators preserved: 'Status: [pass] [warn]' in a brief update.
Error 503: Service Unavailable delivered without extra fluff.

Frequently Asked Questions

Add this skill to your agents

Related Skills

sql-optimization

chaterm/terminal-skills

SQL 优化与调优

tuning

chaterm/terminal-skills

--- name: tuning description: 系统调优 version: 1.0.0 author: terminal-skills tags: [performance, tuning, sysctl, kernel, optimization] --- # 系统调优 ## 概述内核参数、文件系统、网络优化技能。 ## 内核参数调优 ### 内存管理 ```bash # /etc/sysctl.d/99-memory.conf # 减少交换倾向 vm.swappiness = 10 # 脏页刷新 vm.dirty_ratio = 20 vm.dirty_backg

smart-sourcing

athola/claude-night-market

balancing accuracy with token efficiency.

performance-optimization

JanSzewczyk/claude-plugins

Performance optimization patterns for Next.js applications. Covers bundle analysis, React rendering optimization, database query optimization, Core Web Vitals, image optimization, and caching strategies.

Neon Egress Optimizer

openclaw/skills

Audit and optimize database queries to minimize egress (outbound data transfer) costs on Neon Postgres and other cloud databases. Use this skill whenever the user mentions high database costs, Neon billing, egress charges, slow queries, database optimization, query performance, SELECT *, overfetching, N+1 queries, caching database results, or wants to reduce data transfer from their database — even if they don't specifically say 'egress'.

claude-md-optimizer

smith-horn/product-builder-starter

Optimize oversized CLAUDE.md files using progressive disclosure. Analyzes content tiers, detects encryption constraints, creates sub-documents, and rewrites the main file with a Sub-Documentation Table. Triggers: optimize CLAUDE.md, reduce CLAUDE.md size, CLAUDE.md too long, apply progressive disclosure to CLAUDE.md

response-compression

Table of Contents

Response Compression

When To Use

When NOT To Use

Elimination Rules

ELIMINATE

PRESERVE (When Appropriate)

Before/After Transformations

Example 1: Opening Bloat

Example 2: Closing Bloat

Example 3: Hedging Removal

Termination Guidelines

When to Stop

Avoid Trailing Content

Exceptions (When Summaries Help)

Directness Guidelines

Direct =/= Rude

Encouragement Bloat

Rapport-Building Filler

Preserve Helpful Directness

Quick Reference Checklist

Token Impact

Integration

Source

Overview

How This Skill Works

When to Use It

Quick Start

Best Practices

Example Use Cases

Frequently Asked Questions

What is response compression?

How many tokens can be saved?

When should I not use it?

Related Skills

sql-optimization

tuning

smart-sourcing

performance-optimization

Neon Egress Optimizer

claude-md-optimizer