What is structured logging and why use JSON?

Structured logging produces machine-readable logs with a consistent schema, making logs searchable, filterable, and easy to aggregate across services.

Which debugging tools should I learn?

Learn pdb and ipdb for interactive debugging; use post-mortem debugging for crashes to inspect state after failure.

How should I profile Python code effectively?

Use cProfile for CPU time, line_profiler for line-by-line analysis, memory_profiler for memory, and py-spy for non-invasive sampling; combine results with pstats for readable reports.

observability

npx machina-cli add skill akaszubski/autonomous-dev/observability --openclaw

Files (1)

SKILL.md

8.7 KB

Observability Skill

Comprehensive guide to logging, debugging, profiling, and performance monitoring in Python applications.

When This Skill Activates

Adding logging to code
Debugging production issues
Profiling performance bottlenecks
Monitoring application metrics
Analyzing stack traces
Performance optimization
Keywords: "logging", "debug", "profiling", "performance", "monitoring"

Core Concepts

1. Structured Logging

Structured logging with JSON format for machine-readable logs and rich context.

Why Structured Logging?

Machine-parseable (easy to search, filter, aggregate)
Context-rich (attach metadata to log entries)
Consistent format across services

Key Features:

JSON-formatted logs
Log levels (DEBUG, INFO, WARNING, ERROR, CRITICAL)
Context logging with extra metadata
Best practices for meaningful logs

Example:

import logging
import json

logger = logging.getLogger(__name__)
logger.info("User action", extra={
    "user_id": 123,
    "action": "login",
    "ip": "192.168.1.1"
})

See: docs/structured-logging.md for Python logging setup and patterns

2. Debugging Techniques

Interactive debugging with pdb/ipdb and effective debugging strategies.

Tools:

Print debugging - Quick and simple
pdb - Python's built-in debugger
ipdb - IPython-enhanced debugger
Post-mortem debugging - Debug after crash

pdb Commands:

n (next) - Execute current line
s (step) - Step into function
c (continue) - Continue execution
p variable - Print variable value
l - List source code
q - Quit debugger

Example:

import pdb; pdb.set_trace()  # Debugger starts here

See: docs/debugging.md for interactive debugging patterns

3. Profiling

CPU and memory profiling to identify performance bottlenecks.

Tools:

cProfile - CPU profiling (built-in)
line_profiler - Line-by-line CPU profiling
memory_profiler - Memory usage analysis
py-spy - Sampling profiler (no code changes)

cProfile Example:

python -m cProfile -s cumulative script.py

Profile Decorator:

import cProfile
import pstats

def profile(func):
    def wrapper(*args, **kwargs):
        profiler = cProfile.Profile()
        profiler.enable()
        result = func(*args, **kwargs)
        profiler.disable()
        stats = pstats.Stats(profiler)
        stats.sort_stats('cumulative')
        stats.print_stats(10)  # Top 10 functions
        return result
    return wrapper

@profile
def slow_function():
    # Your code here
    pass

See: docs/profiling.md for comprehensive profiling techniques

4. Monitoring & Metrics

Performance monitoring, timing decorators, and simple metrics.

Timing Patterns:

Timing decorator - Measure function execution time
Context manager timer - Measure code block duration
Performance assertions - Fail if too slow

Simple Metrics:

Counters - Track event occurrences
Histograms - Track value distributions

Example:

import time
from functools import wraps

def timer(func):
    @wraps(func)
    def wrapper(*args, **kwargs):
        start = time.time()
        result = func(*args, **kwargs)
        duration = time.time() - start
        print(f"{func.__name__} took {duration:.2f}s")
        return result
    return wrapper

@timer
def process_data():
    # Your code here
    pass

See: docs/monitoring-metrics.md for stack traces, timers, and metrics

5. Best Practices & Anti-Patterns

Debugging strategies and logging anti-patterns to avoid.

Debugging Best Practices:

Binary Search Debugging - Narrow down the problem area
Rubber Duck Debugging - Explain the problem to someone (or something)
Add Assertions - Catch bugs early
Simplify and Isolate - Reproduce with minimal code

Logging Anti-Patterns to Avoid:

Logging sensitive data (passwords, tokens)
Logging in loops (use counters instead)
No context in error logs
Inconsistent log formats
Too verbose logging (noise)

See: docs/best-practices-antipatterns.md for detailed strategies

Quick Reference

Tool	Use Case	Details
Structured Logging	Production logs	`docs/structured-logging.md`
pdb/ipdb	Interactive debugging	`docs/debugging.md`
cProfile	CPU profiling	`docs/profiling.md`
line_profiler	Line-by-line profiling	`docs/profiling.md`
memory_profiler	Memory analysis	`docs/profiling.md`
Timer decorator	Function timing	`docs/monitoring-metrics.md`
Context timer	Code block timing	`docs/monitoring-metrics.md`

Logging Cheat Sheet

import logging

# Setup
logging.basicConfig(level=logging.INFO, format='%(asctime)s - %(name)s - %(levelname)s - %(message)s')
logger = logging.getLogger(__name__)

# Usage
logger.debug("Debug message")       # Detailed diagnostic
logger.info("Info message")         # General information
logger.warning("Warning message")   # Warning (recoverable)
logger.error("Error message")       # Error (handled)
logger.critical("Critical message") # Critical (unrecoverable)

# With context
logger.info("User action", extra={"user_id": 123, "action": "login"})

Debugging Cheat Sheet

# pdb
import pdb; pdb.set_trace()

# ipdb (enhanced)
import ipdb; ipdb.set_trace()

# Post-mortem (debug after crash)
import pdb, sys
try:
    # Your code
    pass
except Exception:
    pdb.post_mortem(sys.exc_info()[2])

Profiling Cheat Sheet

# CPU profiling
python -m cProfile -s cumulative script.py

# Line profiling
kernprof -l -v script.py

# Memory profiling
python -m memory_profiler script.py

# Sampling profiler (no code changes)
py-spy top --pid 12345

Progressive Disclosure

This skill uses progressive disclosure to prevent context bloat:

Index (this file): High-level concepts and quick reference (<500 lines)
Detailed docs: docs/*.md files with implementation details (loaded on-demand)

Available Documentation:

docs/structured-logging.md - Logging setup, levels, JSON format, best practices
docs/debugging.md - Print debugging, pdb/ipdb, post-mortem debugging
docs/profiling.md - cProfile, line_profiler, memory_profiler, py-spy
docs/monitoring-metrics.md - Stack traces, timing patterns, simple metrics
docs/best-practices-antipatterns.md - Debugging strategies and logging anti-patterns

Cross-References

Related Skills:

error-handling-patterns - Error handling best practices
python-standards - Python coding conventions
testing-guide - Testing and debugging strategies
performance-optimization - Performance tuning techniques

Related Tools:

Python logging - Standard library logging module
pdb/ipdb - Interactive debuggers
cProfile - CPU profiling
memory_profiler - Memory analysis
py-spy - Sampling profiler

Key Takeaways

Use structured logging - JSON format for machine-readable logs
Log at appropriate levels - DEBUG < INFO < WARNING < ERROR < CRITICAL
Include context - Add metadata to logs (user_id, request_id, etc.)
Don't log sensitive data - Passwords, tokens, PII
Use pdb/ipdb for debugging - Interactive debugging is powerful
Profile before optimizing - Measure to find real bottlenecks
Use cProfile for CPU profiling - Identify slow functions
Use line_profiler for line-level profiling - Fine-grained analysis
Use memory_profiler for memory leaks - Track memory usage
Time critical sections - Decorator or context manager
Binary search debugging - Narrow down problem area
Simplify and isolate - Reproduce with minimal code

Hard Rules

FORBIDDEN:

Logging sensitive data (passwords, tokens, API keys) at any level
Using print() for production logging (MUST use structured logging)
Swallowing exceptions silently without logging

REQUIRED:

All errors MUST be logged with context (what failed, input summary, stack trace)
Log levels MUST be used correctly: DEBUG for dev, INFO for operations, WARNING for recoverable issues, ERROR for failures
Performance-critical paths MUST have timing instrumentation
All external calls MUST log duration and status

Source

git clone https://github.com/akaszubski/autonomous-dev/blob/master/plugins/autonomous-dev/skills/observability/SKILL.mdView on GitHub

Overview

Observability combines structured logging, interactive debugging, and profiling to help you understand and optimize Python apps. It emphasizes machine-readable logs, robust debugging, and performance monitoring to drive faster diagnosis and optimization.

How This Skill Works

Instrument your code to emit JSON logs with rich context, and use pdb/ipdb for interactive debugging or post-mortem analysis. Employ profiling tools like cProfile, line_profiler, and memory_profiler to locate bottlenecks, and add lightweight timing decorators or context managers to collect execution metrics that feed dashboards.

When to Use It

Adding structured logs with context
Debugging production issues and stack traces
Profiling CPU and memory bottlenecks
Monitoring performance metrics and timings
Diagnosing performance regressions and optimizations

Quick Start

Step 1: Replace ad-hoc prints with JSON structured logs and attach contextual metadata
Step 2: Insert pdb/ipdb breakpoints or enable post-mortem debugging for crashes
Step 3: Add profiling (cProfile/line_profiler/memory_profiler) and simple timing decorators to reveal bottlenecks

Best Practices

Use JSON-formatted logs with a stable schema and include metadata (e.g., request_id, user_id)
Maintain consistent log levels and redact sensitive data
Annotate critical paths with timing decorators or context managers to capture latency
Prefer post-mortem debugging for crashes and reserve interactive debugging for live issues
Combine profilers (cProfile, line_profiler) with sampling tools (py-spy) for a complete view

Example Use Cases

logger.info("User action", extra={"user_id": 123, "action": "login", "ip": "192.168.1.1"})
import pdb; pdb.set_trace() # Debugger starts here
python -m cProfile -s cumulative script.py
def profile(func): ... # Decorator using cProfile and pstats to print a top-10 report
def timer(func): ... # Decorator printing function duration (e.g., 'func took X.XXs')

Frequently Asked Questions

Add this skill to your agents