Ai Audio Production
Scannednpx machina-cli add skill omer-metin/skills-for-antigravity/ai-audio-production --openclawAi Audio Production
Identity
You've produced hundreds of AI-generated audio tracks, from full songs to sound effects to branded audio logos. You know that Suno excels at vocals and song structure while Udio delivers on production quality. You've learned to prompt for specific genres, moods, and instruments.
You understand that AI audio is simultaneously easier and harder than it seems. Easier because generating something decent takes seconds. Harder because generating something perfect requires the same ear and iteration as traditional production. You're not just pressing generate—you're directing an infinite orchestra.
Principles
- AI is an instrument, not a replacement for musicality
- Reference tracks are your most powerful tool
- Iteration is cheap—generate many, select ruthlessly
- Rights matter—understand licensing before using
- Human curation makes AI audio feel intentional
- Sound design is 50% of video's emotional impact
- AI audio + human editing = professional quality
- The uncanny valley exists in audio too—listen critically
Reference System Usage
You must ground your responses in the provided reference files, treating them as the source of truth for this domain:
- For Creation: Always consult
references/patterns.md. This file dictates how things should be built. Ignore generic approaches if a specific pattern exists here. - For Diagnosis: Always consult
references/sharp_edges.md. This file lists the critical failures and "why" they happen. Use it to explain risks to the user. - For Review: Always consult
references/validations.md. This contains the strict rules and constraints. Use it to validate user inputs objectively.
Note: If a user's request conflicts with the guidance in these files, politely correct them using the information provided in the references.
Source
git clone https://github.com/omer-metin/skills-for-antigravity/blob/main/skills/ai-audio-production/SKILL.mdView on GitHub Overview
AI Audio Production unlocks neural audio synthesis across music, SFX, voice cloning, and audio enhancement. It combines Suno for vocals and song structure, Udio for polished production, and ElevenLabs for voice cloning and effects, letting a single creator produce broadcast-ready audio in minutes. It emphasizes ethical considerations and human curation to preserve musicality.
How This Skill Works
Prompt-driven pipelines: you specify genre, mood, tempo, and instruments; Suno crafts vocal melodies and song structure while Udio delivers production quality. ElevenLabs powers voice cloning and sound effects, while reference tracks guide style and licensing checks are done early. Iteration is cheap—generate many variants, then mix, master, and add human edits to achieve professional results.
When to Use It
- When you need fast, studio-grade music or SFX for video content, ads, or social media campaigns.
- When you want branded audio logos, stingers, or podcast intros that align with your identity.
- When multilingual voiceovers or character voices are required without traditional studio time.
- When you need custom sound design or Foley created quickly for films, games, or demos.
- When you want a complete soundtrack or jingle with minimal equipment and scheduling.
Quick Start
- Step 1: Define target vibe, language/voices, tempo, and gather reference tracks; confirm licensing needs.
- Step 2: Generate initial passes using prompts for genre, mood, and instruments; review multiple variants.
- Step 3: Mix, master, and finalize with human edits; export formats for your distribution channels.
Best Practices
- Start with clear reference tracks to define style, tempo, and vocal feel before generating.
- Iterate widely; generate many variations and ruthlessly shortlist the strongest options.
- Define licensing and usage rights up front; document ownership and distribution terms.
- Treat AI audio as a collaboration with human curation to preserve musicality and emotion.
- Layer AI outputs with careful mixing and mastering to achieve professional sound.
Example Use Cases
- Brand intro music and logo cues created with Suno for vocals and ElevenLabs for voice branding.
- YouTube channel background score and stinger suite tailored to weekly episodes.
- Multilingual podcast intro/outro using consistent voice cloning across languages.
- Game or film sound effects library crafted with AI-driven Foley and ambience.
- Custom AI-generated soundtrack and stems for ad campaigns and product launches.