T
Agentic Devops
Scanned@tkuehnl
npx machina-cli add skill @tkuehnl/agentic-devops --openclawFiles (1)
SKILL.md
2.8 KB
When to use this skill
Use this skill when the user wants to:
- Run system diagnostics or health checks
- Manage Docker containers (status, logs, health, compose)
- Inspect running processes, ports, or resource hogs
- Analyze log files for errors, patterns, or frequency
- Check HTTP endpoint availability or port status
- Get a quick one-command system overview
Commands
Quick Diagnostics (start here)
# Full system health report β CPU, memory, disk, Docker, ports, errors, top processes
python3 skills/agentic-devops/devops.py diag
Docker Operations
# Container status overview
python3 skills/agentic-devops/devops.py docker status
# Tail container logs with pattern filtering
python3 skills/agentic-devops/devops.py docker logs <container> --tail 100 --grep "error|warn"
# Docker health summary (running, stopped, unhealthy)
python3 skills/agentic-devops/devops.py docker health
# Docker Compose service status
python3 skills/agentic-devops/devops.py docker compose-status --file docker-compose.yml
Process Management
# List processes sorted by resource usage
python3 skills/agentic-devops/devops.py proc list --sort cpu
# Show ports in use
python3 skills/agentic-devops/devops.py proc ports
# Detect zombie processes
python3 skills/agentic-devops/devops.py proc zombies
Log Analysis
# Analyze log file for error patterns
python3 skills/agentic-devops/devops.py logs analyze /var/log/syslog --pattern "error|fail|critical"
# Tail log file with highlighted patterns
python3 skills/agentic-devops/devops.py logs tail /var/log/app.log --highlight "ERROR|WARN"
# Frequency analysis of log patterns
python3 skills/agentic-devops/devops.py logs frequency /var/log/app.log --top 20
Health Checks
# Check HTTP endpoint health
python3 skills/agentic-devops/devops.py health check https://myapp.com/healthz
# Scan specific ports
python3 skills/agentic-devops/devops.py health ports 80,443,8080,5432
# System resource health (CPU, memory, disk)
python3 skills/agentic-devops/devops.py health system
Requirements
- Python 3.8+ (stdlib only, no external dependencies)
- Docker CLI (optional β Docker sections degrade gracefully if not installed)
- Standard Unix utilities (ps, ss/netstat)
Overview
A production-grade CLI toolkit for agent-level DevOps tasks, covering Docker management, process monitoring, log analysis, and health checks. Built by engineers who run production, it delivers quick diagnostics and operational visibility.
How This Skill Works
Invoked as a Python CLI (devops.py) with subcommands like diag, docker, proc, logs, and health. It relies on Python 3.8+ stdlib, uses the Docker CLI when available, and parses system data and logs to produce actionable insights; it degrades gracefully when Docker isnβt installed.
When to Use It
- Run system diagnostics or health checks
- Manage Docker containers (status, logs, health, compose)
- Inspect running processes, ports, or resource hogs
- Analyze log files for errors, patterns, or frequency
- Check HTTP endpoint availability or port status
Quick Start
- Step 1: Run a quick diagnostic: python3 skills/agentic-devops/devops.py diag
- Step 2: Check containers: python3 skills/agentic-devops/devops.py docker status
- Step 3: Verify service health: python3 skills/agentic-devops/devops.py health check https://myapp.com/healthz
Best Practices
- Run a full diag before making changes in production
- Verify container health and logs before rolling updates
- Cross-check port usage when diagnosing network issues
- Combine log analysis with frequency patterns for incident triage
- Test commands in a staging environment before prod
Example Use Cases
- Get a full system health report: python3 skills/agentic-devops/devops.py diag
- Check container status: python3 skills/agentic-devops/devops.py docker status
- Tail container logs with error patterns: python3 skills/agentic-devops/devops.py docker logs <container> --tail 100 --grep 'error|warn'
- Analyze syslog for errors and patterns: python3 skills/agentic-devops/devops.py logs analyze /var/log/syslog --pattern 'error|fail|critical'
- Verify HTTP endpoint health: python3 skills/agentic-devops/devops.py health check https://myapp.com/healthz
Frequently Asked Questions
Add this skill to your agents