ln-651-query-efficiency-auditor
npx machina-cli add skill levnikolaevich/claude-code-skills/ln-651-query-efficiency-auditor --openclawPaths: File paths (
shared/,references/,../ln-*) are relative to skills repo root. If not found at CWD, locate this SKILL.md directory and go up one level for repo root.
Query Efficiency Auditor (L3 Worker)
Specialized worker auditing database query patterns for redundancy, inefficiency, and misuse.
Purpose & Scope
- Worker in ln-650 coordinator pipeline - invoked by ln-650-persistence-performance-auditor
- Audit query efficiency (Priority: HIGH)
- Check redundant fetches, batch operation misuse, caching scope problems
- Write structured findings to file with severity, location, effort, recommendations
- Calculate compliance score (X/10) for Query Efficiency category
Inputs (from Coordinator)
MANDATORY READ: Load shared/references/task_delegation_pattern.md#audit-coordinator--worker-contract for contextStore structure.
Receives contextStore with: tech_stack, best_practices, db_config (database type, ORM settings), codebase_root, output_dir.
Domain-aware: Supports domain_mode + current_domain.
Workflow
-
Parse context from contextStore
- Extract tech_stack, best_practices, db_config, output_dir
- Determine scan_path (same logic as ln-624)
-
Scan codebase for violations
- All Grep/Glob patterns use
scan_path - Trace call chains for redundant fetches (requires reading caller + callee)
- All Grep/Glob patterns use
-
Collect findings with severity, location, effort, recommendation
-
Calculate score using penalty algorithm
-
Write Report: Build full markdown report in memory per
shared/templates/audit_worker_report_template.md, write to{output_dir}/651-query-efficiency.mdin single Write call -
Return Summary: Return minimal summary to coordinator (see Output Format)
Audit Rules (Priority: HIGH)
1. Redundant Entity Fetch
What: Same entity fetched from DB twice in a call chain
Detection:
- Find function A that calls
repo.get(id)orsession.get(Model, id), then passesid(not object) to function B - Function B also calls
repo.get(id)orsession.get(Model, id)for the same entity - Common pattern:
acquire_next_pending()returns job, but_process_job(job_id)re-fetches it
Detection patterns (Python/SQLAlchemy):
- Grep for
repo.*get_by_id|session\.get\(|session\.query.*filter.*idin service/handler files - Trace: if function receives
entity_id: int/UUIDAND internally doesrepo.get(entity_id), check if caller already has entity object - Check
expire_on_commitsetting: ifFalse, objects remain valid after commit
Severity:
- HIGH: Redundant fetch in API request handler (adds latency per request)
- MEDIUM: Redundant fetch in background job (less critical)
Recommendation: Pass entity object instead of ID, or remove second fetch when expire_on_commit=False
Effort: S (change signature to accept object instead of ID)
2. N-UPDATE/DELETE Loop
What: Loop of individual UPDATE/DELETE operations instead of single batch query
Detection:
- Pattern:
for item in items: await repo.update(item.id, ...)orfor item in items: await repo.delete(item.id) - Pattern:
for item in items: session.execute(update(Model).where(...))
Detection patterns:
- Grep for
for .* in .*:followed byrepo\.(update|delete|reset|save|mark_)within 1-3 lines - Grep for
for .* in .*:followed bysession\.execute\(.*update\(within 1-3 lines
Severity:
- HIGH: Loop over >10 items (N separate round-trips to DB)
- MEDIUM: Loop over <=10 items
Recommendation: Replace with single UPDATE ... WHERE id IN (...) or session.execute(update(Model).where(Model.id.in_(ids)))
Effort: M (rewrite query + test)
3. Unnecessary Resolve
What: Re-resolving a value from DB when it is already available in the caller's scope
Detection:
- Method receives
profile_idand resolves engine from it, but caller already determinedengine - Method receives
lang_codeand looks up dialect_id, but caller already has bothlanganddialect - Pattern: function receives
X_id, doesget(X_id), extracts.field, when caller already hasfield
Severity:
- MEDIUM: Extra DB query per invocation, especially in high-frequency paths
Recommendation: Split method into two variants: with_known_value(value, ...) and resolving_value(id, ...); or pass resolved value directly
Effort: S-M (refactor signature, update callers)
4. Over-Fetching
What: Loading full ORM model when only few fields are needed
Detection:
session.query(Model)orselect(Model)without.options(load_only(...))for models with >10 columns- Especially in list/search endpoints that return many rows
- Pattern: loading full entity but only using 2-3 fields
Severity:
- MEDIUM: Large models (>15 columns) in list endpoints
- LOW: Small models (<10 columns) or single-entity endpoints
Recommendation: Use load_only(), defer(), or raw select(Model.col1, Model.col2) for list queries
Effort: S (add load_only to query)
5. Missing Bulk Operations
What: Sequential INSERT/DELETE/UPDATE instead of bulk operations
Detection:
for item in items: session.add(item)instead ofsession.add_all(items)for item in items: session.delete(item)instead of bulk delete- Pattern: loop with single
INSERTper iteration
Severity:
- MEDIUM: Any sequential add/delete in loop (missed batch optimization)
Recommendation: Use session.add_all(), session.execute(insert(Model).values(list_of_dicts)), bulk_save_objects()
Effort: S (replace loop with bulk call)
6. Wrong Caching Scope
What: Request-scoped cache for data that rarely changes (should be app-scoped)
Detection:
- Service registered as request-scoped (e.g., via FastAPI
Depends()) with internal cache (_cachedict,_loadedflag) - Cache populated by expensive query (JOINs, aggregations) per each request
- Data TTL >> request duration (e.g., engine configurations, language lists, feature flags)
Detection patterns:
- Find classes with
_cache,_loaded,_initializedattributes - Check if class is created per-request (via DI registration scope)
- Compare: data change frequency vs cache lifetime
Severity:
- HIGH: Expensive query (JOINs, subqueries) cached only per-request
- MEDIUM: Simple query cached per-request
Recommendation: Move cache to app-scoped service (singleton), add TTL-based invalidation, or use CacheService with configurable TTL
Effort: M (change DI scope, add TTL logic)
Scoring Algorithm
MANDATORY READ: Load shared/references/audit_scoring.md for unified scoring formula.
Output Format
MANDATORY READ: Load shared/templates/audit_worker_report_template.md for file format.
Write report to {output_dir}/651-query-efficiency.md with category: "Query Efficiency" and checks: redundant_fetch, n_update_delete_loop, unnecessary_resolve, over_fetching, missing_bulk_ops, wrong_caching_scope.
Return summary to coordinator:
Report written: docs/project/.audit/ln-650/{YYYY-MM-DD}/651-query-efficiency.md
Score: X.X/10 | Issues: N (C:N H:N M:N L:N)
Critical Rules
- Do not auto-fix: Report only
- Trace call chains: Rules 1 and 3 require reading both caller and callee
- ORM-aware: Check
expire_on_commit,autoflush, session scope before flagging redundant fetches - Context-aware: Small datasets or infrequent operations may justify simpler code
- Exclude tests: Do not flag test fixtures or setup code
Definition of Done
- contextStore parsed successfully (including output_dir)
- scan_path determined (domain path or codebase root)
- All 6 checks completed:
- redundant fetch, N-UPDATE loop, unnecessary resolve, over-fetching, bulk ops, caching scope
- Findings collected with severity, location, effort, recommendation
- Score calculated using penalty algorithm
- Report written to
{output_dir}/651-query-efficiency.md(atomic single Write call) - Summary returned to coordinator
Reference Files
- Worker report template:
shared/templates/audit_worker_report_template.md - Audit scoring formula:
shared/references/audit_scoring.md - Audit output schema:
shared/references/audit_output_schema.md
Version: 1.0.0 Last Updated: 2026-02-04
Source
git clone https://github.com/levnikolaevich/claude-code-skills/blob/master/ln-651-query-efficiency-auditor/SKILL.mdView on GitHub Overview
ln-651-query-efficiency-auditor scans code for inefficient DB queries, flagging redundant entity fetches, N-UPDATE/DELETE loops, over-fetching, and caching issues. It records findings with severity, location, effort, and recommendations, and computes a compliance score for query efficiency. This reduces latency and database load in critical paths.
How This Skill Works
The worker loads contextStore (tech_stack, best_practices, db_config, codebase_root, output_dir) and determines scan_path. It scans the codebase with Grep/Glob patterns, traces call chains to confirm patterns like duplicate repo.get/session.get and N-item update loops, collects structured findings, and writes a full markdown audit report to the output_dir while computing a numeric score.
When to Use It
- Suspected redundant entity fetch across a call chain
- N-UPDATE/DELETE loop executing per-item DB calls
- Unnecessary resolves or over-fetching before returning data
- Missing bulk operations where a single bulk query would suffice
- Caching scope misalignments causing stale reads or incorrect persistence
Quick Start
- Step 1: Load the contextStore (tech_stack, best_practices, db_config, codebase_root, output_dir) from the coordinator.
- Step 2: Run codebase scan using Grep/Glob patterns aligned with the rules to detect redundant fetches, N-UPDATE/DELETE loops, and bulk gaps.
- Step 3: Build the audit report in memory and write to {output_dir}/651-query-efficiency.md in a single Write call using the audit template.
Best Practices
- Trace call chains for redundant fetches and confirm both caller and callee patterns
- Prefer passing entity objects instead of IDs to downstream code to avoid re-fetches
- Replace per-item DB calls with bulk operations (e.g., IN (...) or bulk UPDATE/DELETE)
- Limit selected fields to only what is needed to reduce over-fetching
- Validate caching behavior and expire_on_commit settings to prevent stale objects after commits
Example Use Cases
- API handler fetches an entity by id and then passes the id again, triggering a second fetch in the callee
- Batch processing loops that call update/delete for each item individually instead of a single bulk operation
- Code resolves relations unnecessarily, causing extra lazy loads or extra queries
- Code selects entire user rows when only IDs or a subset of fields are needed
- expire_on_commit is disabled or misconfigured, leading to stale objects after commit and unintended re-fetches