What if taxonomy is missing?

If no taxonomy is provided, the system can infer plausible categories from context, but precision improves with an explicit taxonomy.

Can I classify non-text items like images or code files?

Yes. classify works with items such as entities, documents, code, or data, guided by the provided taxonomy.

How is the output structured and what happens if there are multiple good labels?

Output includes labels with per-label confidence and evidence, a primary_label for the top match, an ambiguous flag, a global confidence, and optional probabilities and evidence_anchors. Use multi_label to return multiple strong labels.

classify

npx machina-cli add skill synaptiai/agent-capability-standard/classify --openclaw

Files (1)

SKILL.md

6.2 KB

Intent

Assign one or more labels from a defined taxonomy to items based on their observed characteristics. This capability bridges detection and reasoning by providing semantic categorization.

Success criteria:

Item assigned at least one label from taxonomy
Label assignment supported by evidence
Confidence scores reflect classification certainty
Ambiguous cases explicitly flagged

Compatible schemas:

schemas/output_schema.yaml

Inputs

Parameter	Required	Type	Description
`item`	Yes	any	The item to classify (entity, document, code, data)
`taxonomy`	No	string\|array	Classification scheme or list of valid labels
`multi_label`	No	boolean	Whether multiple labels can be assigned (default: false)
`context`	No	object	Additional context to inform classification

Procedure

Examine the item: Gather characteristics relevant to classification
- Identify distinguishing features
- Note structural patterns, content type, metadata
- Collect evidence for each observed characteristic
Understand the taxonomy: Clarify the classification scheme
- If taxonomy provided, use those labels exclusively
- If no taxonomy, infer appropriate categories from context
- Define clear boundaries between categories
Match characteristics to labels: Evaluate fit for each potential label
- Score how well item characteristics match each category
- Consider edge cases and borderline classifications
- Note which features drive each potential classification
Assign labels: Select the most appropriate label(s)
- For single-label: choose highest confidence match
- For multi-label: include all labels above confidence threshold
- Flag if no label is a strong match
Ground classification: Document evidence supporting each label
- Reference specific characteristics that drove classification
- Note any characteristics that contradict the assignment

Output Contract

Return a structured object:

labels:
  - label: string  # Assigned category/label
    confidence: number  # 0.0-1.0 for this specific label
    evidence: array[string]  # Characteristics supporting this label
probabilities:  # Optional: full probability distribution
  label_1: number
  label_2: number
primary_label: string  # Highest confidence label
ambiguous: boolean  # True if classification is uncertain
confidence: 0..1  # Overall classification confidence
evidence_anchors: ["file:line", "characteristic:value"]
assumptions: []

Field Definitions

Field	Type	Description
`labels`	array	Assigned labels with confidence and evidence
`probabilities`	object	Probability for each possible label
`primary_label`	string	Single best classification
`ambiguous`	boolean	Whether classification is uncertain
`confidence`	number	0.0-1.0 overall confidence

Examples

Example 1: Classify Code File

Input:

item: "src/services/payment_processor.rb"
taxonomy: ["service", "model", "controller", "helper", "concern"]

Output:

labels:
  - label: "service"
    confidence: 0.92
    evidence:
      - "Located in services/ directory"
      - "Class name ends with 'Processor'"
      - "Contains business logic methods without persistence"
probabilities:
  service: 0.92
  model: 0.03
  controller: 0.02
  helper: 0.02
  concern: 0.01
primary_label: "service"
ambiguous: false
confidence: 0.92
evidence_anchors:
  - "src/services/payment_processor.rb:1"
  - "src/services/payment_processor.rb:15-45"
assumptions:
  - "Directory structure follows Rails conventions"

Example 2: Classify with Ambiguity

Input:

item: "User reported issue: 'App crashes on login'"
taxonomy: ["bug", "feature-request", "question", "documentation"]
multi_label: true

Output:

labels:
  - label: "bug"
    confidence: 0.75
    evidence:
      - "Reports crash behavior"
      - "Describes unexpected failure"
  - label: "question"
    confidence: 0.40
    evidence:
      - "Lacks reproduction steps"
      - "May be user error or configuration"
probabilities:
  bug: 0.75
  question: 0.40
  feature-request: 0.05
  documentation: 0.10
primary_label: "bug"
ambiguous: true
confidence: 0.65
evidence_anchors:
  - "issue:title"
  - "issue:body"
assumptions:
  - "Crash is not expected behavior"
  - "User has attempted normal login flow"

Verification

At least one label assigned with confidence > 0.3
Evidence exists for each assigned label
Labels are from specified taxonomy (if provided)
Ambiguous flag set when confidence < 0.7
Probabilities sum to ~1.0 (if provided)

Verification tools: Read (to verify evidence references)

Safety Constraints

mutation: false
requires_checkpoint: false
requires_approval: false
risk: low

Capability-specific rules:

Do not invent labels outside the provided taxonomy
Flag uncertainty rather than forcing low-confidence classifications
Do not access data beyond what's needed for classification
Note when item characteristics are insufficient for reliable classification

Composition Patterns

Commonly follows:

detect - Classify items after detecting their presence
observe - Classify based on observed characteristics
retrieve - Classify retrieved items

Commonly precedes:

compare - Classification enables comparison within categories
plan - Classified items inform planning decisions
generate - Classification guides content generation

Anti-patterns:

Never use classify for binary detection (use detect)
Avoid classify when precise measurement needed (use measure)

Workflow references:

See reference/workflow_catalog.yaml#capability_gap_analysis for classification usage

Source

git clone https://github.com/synaptiai/agent-capability-standard/blob/main/skills/classify/SKILL.mdView on GitHub

Overview

Classify assigns one or more labels from a defined taxonomy to an item based on observed characteristics. It bridges detection and reasoning by providing semantic categorization with traceable evidence. Labels come with confidence scores and can flag ambiguous classifications.

How This Skill Works

The process examines the item to extract distinguishing features, then matches those features to the provided taxonomy. It assigns the most appropriate label(s), recording a confidence score for each label and supporting evidence. If multi_label is enabled, it can return multiple labels that exceed a confidence threshold, along with ground anchors and evidence.

When to Use It

Categorizing entities (people, objects, or concepts) in a dataset
Tagging content (articles, images, code) with a defined taxonomy
Identifying types in documents (reports, memos, emails)
Organizing data for search, retrieval, or routing based on labels
Labeling data for supervised learning or dataset curation

Quick Start

Step 1: Provide the item and a taxonomy (optionally set multi_label).
Step 2: Run classify to generate labels with confidence and evidence.
Step 3: Review primary_label, ambiguity, and evidence; export results.

Best Practices

Provide a clear, well-defined taxonomy before classification
Enable multi_label only when items can legitimately belong to several categories
Ground each label with evidence anchors citing the driving features
Set a sensible confidence threshold and review low-confidence cases
Iterate taxonomy and rules as new edge cases emerge

Example Use Cases

Classify a customer support ticket as billing, technical, or account-related based on content
Tag a news article with topics such as politics, economy, or sports
Label a source code file as service, model, controller, or helper based on location and naming patterns
Annotate product reviews with sentiment labels (positive, negative, neutral) and urgency when applicable
Organize research documents by domain (biology, physics, computer science) and data type (paper, dataset, code)

Frequently Asked Questions

Add this skill to your agents