Alicloud Data Lake Dlf
Scanned@cinience
npx machina-cli add skill @cinience/alicloud-data-lake-dlf --openclawCategory: service
Data Lake Formation
Use Alibaba Cloud OpenAPI (RPC) with official SDKs or OpenAPI Explorer to manage resources for Data Lake Formation.
Workflow
- Confirm region, resource identifiers, and desired action.
- Discover API list and required parameters (see references).
- Call API with SDK or OpenAPI Explorer.
- Verify results with describe/list APIs.
AccessKey priority (must follow)
- Environment variables:
ALICLOUD_ACCESS_KEY_ID/ALICLOUD_ACCESS_KEY_SECRET/ALICLOUD_REGION_IDRegion policy:ALICLOUD_REGION_IDis an optional default. If unset, decide the most reasonable region for the task; if unclear, ask the user. - Shared config file:
~/.alibabacloud/credentials
API discovery
- Product code:
DataLake - Default API version:
2020-07-10 - Use OpenAPI metadata endpoints to list APIs and get schemas (see references).
High-frequency operation patterns
- Inventory/list: prefer
List*/Describe*APIs to get current resources. - Change/configure: prefer
Create*/Update*/Modify*/Set*APIs for mutations. - Status/troubleshoot: prefer
Get*/Query*/Describe*StatusAPIs for diagnosis.
Minimal executable quickstart
Use metadata-first discovery before calling business APIs:
python scripts/list_openapi_meta_apis.py
Optional overrides:
python scripts/list_openapi_meta_apis.py --product-code <ProductCode> --version <Version>
The script writes API inventory artifacts under the skill output directory.
Output policy
If you need to save responses or generated artifacts, write them under:
output/alicloud-data-lake-dlf/
References
- Sources:
references/sources.md
Overview
This skill enables programmatic control of Alibaba Cloud Data Lake Formation resources using OpenAPI or official SDKs. It covers listing resources, creating or updating configurations, querying status, and troubleshooting workflows, helping automate governance and operational tasks.
How This Skill Works
Authenticate with environment variables or a shared config, then discover the Data Lake API surface using the DataLake product code (default API version 2020-07-10). Use List/Describe for inventory, Create/Update/Modify/Set for changes, and Get/Query/DescribeStatus for diagnosis, verifying results with subsequent describe/list calls.
When to Use It
- Inventory current Data Lake resources in a region.
- Create or update a Data Lake Formation configuration via API/SDK.
- Check the status of resources, jobs, or configurations.
- Troubleshoot failed workflows using status or describe APIs.
- Discover available APIs and schemas to automate Data Lake tasks.
Quick Start
- Step 1: Confirm region and resource identifiers (region, account, IDs).
- Step 2: Discover APIs with python scripts/list_openapi_meta_apis.py.
- Step 3: Call the API via SDK or OpenAPI Explorer and verify results with describe/list; save outputs under output/alicloud-data-lake-dlf.
Best Practices
- Prefer List*/Describe* calls to safely inventory before mutations.
- Use Create*/Update*/Modify*/Set* for changes to configurations.
- Store credentials in environment variables or ~/.alibabacloud/credentials as described.
- Always verify results with Describe/DescribeStatus after changes.
- Write artifacts and responses under output/alicloud-data-lake-dlf for traceability.
Example Use Cases
- List current Data Lake resources in a region to assess existing setup.
- Create or update a Data Lake configuration to reflect new governance rules.
- Query the status of a Data Lake job or resource to monitor progress.
- Describe status after troubleshooting to confirm resolution.
- Run API discovery script to enumerate available Data Lake API endpoints and schemas.