Agent Cost Monitor
Verified@neal-collab
npx machina-cli add skill @neal-collab/agent-cost-monitor --openclawAgent Cost Monitor β Know What Your Agents Cost
Track token usage, costs, and efficiency across all your OpenClaw agents in real-time. Get alerts before you blow your budget.
The Problem
Running multiple agents is powerful β but expensive if you're not watching:
- Which agent is burning the most tokens?
- Are heartbeats wasting money on expensive models?
- Is caching actually saving you anything?
- When will you hit your weekly rate limit?
What This Skill Does
When triggered (via cron or manually), the agent:
- Checks
session_statusfor each agent - Calculates per-agent and total costs
- Compares against budget thresholds
- Sends alerts if limits are approaching
- Suggests optimization moves
Usage
Ask your monitoring agent (or any agent with this skill):
"Give me a cost report for all agents"
"Which agent used the most tokens today?"
"Am I going to hit my rate limit this week?"
Automated Daily Report (Cron)
{
"name": "Daily Cost Report",
"schedule": { "kind": "cron", "expr": "0 20 * * *", "tz": "Europe/Berlin" },
"payload": {
"kind": "agentTurn",
"message": "Run a cost report across all agents. Check session_status for each. Report: total tokens, cost per agent, top spender, budget warnings. Send summary to user."
},
"sessionTarget": "isolated",
"delivery": { "mode": "announce" }
}
Cost Report Format
When generating a report, use this structure:
## π° Agent Cost Report β [Date]
### Per-Agent Breakdown
| Agent | Model | Tokens (24h) | Est. Cost | Status |
|-------|-------|-------------|-----------|--------|
| Central | Opus 4.6 | 125K | $1.87 | β οΈ High |
| Techops | Opus 4.6 | 89K | $1.33 | β
Normal |
| Atlas | Sonnet 4.5 | 45K | $0.27 | β
Low |
| Closer | Haiku 4.5 | 23K | $0.02 | β
Minimal |
| Heartbeats | Ollama | 12K | $0.00 | β
Free |
### Summary
- **Total 24h:** 294K tokens (~$3.49)
- **Projected weekly:** ~$24.43
- **Budget:** $20/week β β οΈ 122% projected
### Recommendations
1. Move Techops from Opus β Sonnet for routine tasks (-40% cost)
2. Increase heartbeat interval from 15m β 30m
3. Enable context pruning on Atlas (idle sessions burning cache)
Model Cost Reference
Use these rates for estimation (as of 2026):
Anthropic (Claude OAuth / API)
| Model | Input/1M | Output/1M | Cache Read/1M | Cache Write/1M |
|---|---|---|---|---|
| Opus 4.6 | $5.00 | $25.00 | $0.50 | $6.25 |
| Sonnet 4.5 | $3.00 | $15.00 | $0.30 | $3.75 |
| Haiku 4.5 | $1.00 | $5.00 | $0.08 | $1.25 |
Free Options
| Model | Cost | Use For |
|---|---|---|
| Ollama (local) | $0 | Heartbeats, simple tasks |
| Gemini OAuth | $0* | Fallback (rate limited) |
*Free tier with rate limits
Optimization Playbook
Quick Wins (Do These First)
- Heartbeats on Ollama
{ "heartbeat": { "model": "ollama/llama3.2:3b" } }
Saves: 100% of heartbeat costs (can be $5-10/week with Opus)
- Haiku Cache Retention Off
{ "anthropic/claude-haiku-4-5": { "params": { "cacheRetention": "none" } } }
Saves: Cache write costs on cheap model (not worth caching)
- Context Pruning
{ "contextPruning": { "mode": "cache-ttl", "ttl": "5m" } }
Saves: Stale context re-reads on every turn
- Opus/Sonnet Cache Retention Long
{ "anthropic/claude-opus-4-6": { "params": { "cacheRetention": "long" } } }
Saves: Re-sending system prompt every turn (biggest single saving)
Model Tiering (Biggest Impact)
| Task Type | Use This | Not This | Saving |
|---|---|---|---|
| Coordination, complex reasoning | Opus | β | Justified |
| Finance, data analysis | Sonnet | Opus | -40% |
| Sales drafts, marketing copy | Haiku | Sonnet | -67% |
| Heartbeats, health checks | Ollama | Any paid | -100% |
| Tweet drafts | Haiku or Grok | Opus | -80% |
Session Management
- Daily reset: Sessions auto-clear at a set hour (reduces token accumulation)
{ "session": { "reset": { "mode": "daily", "atHour": 4, "idleMinutes": 45 } } }
- Memory flush: Save important context before compaction
{ "compaction": { "memoryFlush": { "enabled": true } } }
Alert Thresholds
Configure in your monitoring agent's memory:
## Budget Alerts
- Daily budget: $5.00 (warn at 80% = $4.00)
- Weekly budget: $20.00 (warn at 70% = $14.00)
- Per-agent daily max: $2.00
- Alert channel: Telegram DM
Integration with DevOps Agent
If you have a DevOps/monitoring agent (e.g. your DevOps agent), add to its AGENTS.md:
## Cost Monitoring
- Run daily cost report at 20:00
- Alert if any agent exceeds $2/day
- Weekly summary every Monday 09:00
- Track trends: is usage going up or down?
FAQ
Q: Does this skill make API calls?
A: No. It uses OpenClaw's built-in session_status tool. No external APIs, no additional costs.
Q: How accurate are cost estimates? A: Based on published model pricing. Actual costs may vary with caching hits. Estimates are conservative (slightly high).
Q: Can I track costs per conversation?
A: Not directly. Costs are tracked per session. Use sessions_list to see per-session token counts.
Q: Works with non-Anthropic models? A: Yes. Token counts work for all providers. Cost estimation requires known pricing (add custom rates in the cost reference section).
Changelog
v1.1.0
- Generalized all agent names in examples
- No specific setup references
v1.0.0
- Initial release
Overview
Agent Cost Monitor tracks token usage, costs, and efficiency across all OpenClaw agents in real-time. It keeps budgets in sight with proactive alerts and practical optimization tips to reduce spend.
How This Skill Works
Triggered via cron or manual, it checks session_status for every agent, computes per-agent and total costs, and compares them against budget thresholds. When limits approach, it sends alerts and suggests concrete optimization moves. It also supports automated Daily Cost Reports with a per-agent breakdown.
When to Use It
- You run multiple OpenClaw agents and need real-time visibility into spending.
- You want to identify which agent is consuming the most tokens.
- Youβre approaching weekly or monthly rate limits and need forecasts.
- You require automated daily cost reporting and budget alerts.
- Youβre optimizing costs by adjusting heartbeats, caching, or model choices.
Quick Start
- Step 1: Trigger cost reporting via cron or run it manually.
- Step 2: Review the Per-Agent Breakdown and total costs in the report.
- Step 3: Implement recommended optimizations (e.g., adjust heartbeats, enable pruning, shift to cheaper models) and schedule next checks.
Best Practices
- Set clear weekly/daily budget thresholds and alert levels.
- Review per-agent cost breakdown regularly to identify top spenders.
- Use the Automated Daily Report cron to keep stakeholders informed.
- Test optimization moves (e.g., relocate tasks to cheaper models, adjust heartbeats, enable pruning) before broad deployment.
- Keep model cost references up to date and align usage with cost priorities.
Example Use Cases
- A daily cost report flags Opus 4.6 as the top spender and suggests moving routine tasks to Sonnet 4.5 to reduce costs.
- Heartbeat tasks are moved to Ollama, eliminating heartbeat costs and lowering ongoing expenses.
- Context pruning cuts stale context reads, reducing unnecessary token usage.
- Weekly projections detect a looming budget overrun and trigger proactive alerts with recommendations.
- Per-agent breakdown reveals Atlas benefits from enabling longer cache retention on cheaper models.