Get the FREE Ultimate OpenClaw Setup Guide →

capacity

Scanned
npx machina-cli add skill microsoft/GitHub-Copilot-for-Azure/capacity --openclaw
Files (1)
SKILL.md
6.8 KB

Capacity Discovery

Finds available Azure OpenAI model capacity across all accessible regions and projects. Recommends the best deployment location based on capacity requirements.

Quick Reference

PropertyDescription
PurposeFind where you can deploy a model with sufficient capacity
ScopeAll regions and projects the user has access to
OutputRanked table of regions/projects with available capacity
ActionRead-only analysis — does NOT deploy. Hands off to preset or customize
AuthenticationAzure CLI (az login)

When to Use This Skill

  • ✅ User asks "where can I deploy gpt-4o?"
  • ✅ User specifies a capacity target: "find a region with 10K TPM for gpt-4o"
  • ✅ User wants to compare availability: "which regions have gpt-4o available?"
  • ✅ User got a quota error and needs to find an alternative location
  • ✅ User asks "best region and project for deploying model X"

After discovery → hand off to preset or customize for actual deployment.

Scripts

Pre-built scripts handle the complex REST API calls and data processing. Use these instead of constructing commands manually.

ScriptPurposeUsage
scripts/discover_and_rank.ps1Full discovery: capacity + projects + rankingPrimary script for capacity discovery
scripts/discover_and_rank.shSame as above (bash)Primary script for capacity discovery
scripts/query_capacity.ps1Raw capacity query (no project matching)Quick capacity check or version listing
scripts/query_capacity.shSame as above (bash)Quick capacity check or version listing

Workflow

Phase 1: Validate Prerequisites

az account show --query "{Subscription:name, SubscriptionId:id}" --output table

Phase 2: Identify Model and Version

Extract model name from user prompt. If version is unknown, query available versions:

.\scripts\query_capacity.ps1 -ModelName <model-name>
./scripts/query_capacity.sh <model-name>

This lists available versions. Use the latest version unless user specifies otherwise.

Phase 3: Run Discovery

Run the full discovery script with model name, version, and minimum capacity target:

.\scripts\discover_and_rank.ps1 -ModelName <model-name> -ModelVersion <version> -MinCapacity <target>
./scripts/discover_and_rank.sh <model-name> <version> <min-capacity>

💡 The script automatically queries capacity across ALL regions, cross-references with the user's existing projects, and outputs a ranked table sorted by: meets target → project count → available capacity.

Phase 3.5: Validate Subscription Quota

After discovery identifies candidate regions, validate that the user's subscription actually has available quota in each region. Model capacity (from Phase 3) shows what the platform can support, but subscription quota limits what this specific user can deploy.

# For each candidate region from discovery results:
$usageData = az cognitiveservices usage list --location <region> --subscription $SUBSCRIPTION_ID -o json 2>$null | ConvertFrom-Json

# Check quota for each SKU the model supports
# Quota names follow pattern: OpenAI.<SKU>.<model-name>
$usageEntry = $usageData | Where-Object { $_.name.value -eq "OpenAI.<SKU>.<model-name>" }

if ($usageEntry) {
  $quotaAvailable = $usageEntry.limit - $usageEntry.currentValue
} else {
  $quotaAvailable = 0  # No quota allocated
}
# For each candidate region from discovery results:
usage_json=$(az cognitiveservices usage list --location <region> --subscription "$SUBSCRIPTION_ID" -o json 2>/dev/null)

# Extract quota for specific SKU+model
quota_available=$(echo "$usage_json" | jq -r --arg name "OpenAI.<SKU>.<model-name>" \
  '.[] | select(.name.value == $name) | .limit - .currentValue')

Annotate discovery results:

Add a "Quota Available" column to the ranked output from Phase 3:

RegionAvailable CapacityMeets TargetProjectsQuota Available
eastus2120K TPM3✅ 80K
westus390K TPM1❌ 0 (at limit)
swedencentral100K TPM0✅ 100K

Regions/SKUs where quotaAvailable = 0 should be marked with ❌ in the results. If no region has available quota, hand off to the quota skill for increase requests and troubleshooting.

Phase 4: Present Results and Hand Off

After the script outputs the ranked table (now annotated with quota info), present it to the user and ask:

  1. 🚀 Quick deploy to top recommendation with defaults → route to preset
  2. ⚙️ Custom deploy with version/SKU/capacity/RAI selection → route to customize
  3. 📊 Check another model or capacity target → re-run Phase 2
  4. ❌ Cancel

Phase 5: Confirm Project Before Deploying

Before handing off to preset or customize, always confirm the target project with the user. See the Project Selection rules in the parent router.

If the discovery table shows a sample project for the chosen region, suggest it as the default. Otherwise, query projects in that region and let the user pick.

Error Handling

ErrorCauseResolution
"No capacity found"Model not available or all at quotaHand off to quota skill for increase requests and troubleshooting
Script auth erroraz login expiredRe-run az login
Empty version listModel not in region catalogTry a different region: ./scripts/query_capacity.sh <model> "" eastus
"No projects found"No AI Services resourcesGuide to project/create skill or Azure Portal

Related Skills

  • preset — Quick deployment after capacity discovery
  • customize — Custom deployment after capacity discovery
  • quota — For quota viewing, increase requests, and troubleshooting quota errors, defer to this skill instead of duplicating guidance

Source

git clone https://github.com/microsoft/GitHub-Copilot-for-Azure/blob/main/plugin/skills/microsoft-foundry/models/deploy-model/capacity/SKILL.mdView on GitHub

Overview

Capacity Discovery scans all accessible Azure regions and projects to reveal available Azure OpenAI model capacity and quota. It ranks deployment options to help you pick the best region and project for your capacity needs, without performing deployments.

How This Skill Works

The skill uses pre-built scripts (discover_and_rank.ps1/.sh and query_capacity.ps1/.sh) to query capacity across all regions and user projects, then produces a ranked table of options. It runs read-only analysis (no deployment) and can hand off to a preset or customize step for actual deployment after discovery, with authentication via Azure CLI (az login).

When to Use It

  • User asks "where can I deploy gpt-4o?"
  • User specifies a capacity target: "find a region with 10K TPM for gpt-4o"
  • User wants to compare availability: "which regions have gpt-4o available?"
  • User got a quota error and needs to find an alternative location
  • User asks "best region and project for deploying model X"

Quick Start

  1. Step 1: Validate prerequisites: az account show --query "{Subscription:name, SubscriptionId:id}" --output table
  2. Step 2: Identify Model and Version: .\scripts\query_capacity.ps1 -ModelName <model-name> or ./scripts/query_capacity.sh <model-name>
  3. Step 3: Run Discovery: .\scripts\discover_and_rank.ps1 -ModelName <model-name> -ModelVersion <version> -MinCapacity <target> or ./scripts/discover_and_rank.sh <model-name> <version> <min-capacity>

Best Practices

  • Pre-authenticate with Azure CLI (az login) before running capacity discovery.
  • Run the discovery scripts to fetch capacity across all regions and projects you access.
  • Provide a clear MinCapacity target to get a ranked, actionable list.
  • After discovery, verify subscription quota in candidate regions before deployment.
  • Treat the output as read-only and hand off to preset or customize for deployment.

Example Use Cases

  • Find a region with sufficient capacity for gpt-4o and compare multiple regions for TPM availability.
  • Identify the best region and project to deploy a new model version based on current quotas.
  • Work around a quota error by discovering alternative regions with usable capacity.
  • Perform multi-project capacity search to support a cross-tenant deployment plan.
  • Rank regions by capacity to optimize latency and cost without initiating deployment.

Frequently Asked Questions

Add this skill to your agents
Sponsor this space

Reach thousands of developers