dalle
npx machina-cli add skill G1Joshi/Agent-Skills/dalle --openclawDALL-E 3 / 4
DALL-E is OpenAI's image model. It excels at Prompt Adherenceβit draws exactly what you ask for, including complex text.
When to Use
- ChatGPT Integration: "Draw a diagram of..." inside a chat flow.
- Complex Instructions: "A cat holding a sign that says 'Hello World'".
- Safety: Strict guardrails against NSFW/Copyright.
Core Concepts
Integration
Available via OpenAI API (dall-e-3) and ChatGPT.
Editing
In ChatGPT, you can select an area of the image and say "Make the cat blue".
Best Practices (2025)
Do:
- Use via ChatGPT: The conversational refinement is its superpower.
- Ask for JSON: The API can return the image or a revised prompt.
Don't:
- Don't expect photorealism: It often has a "plastic" or "smooth" look compared to Midjourney.
References
Source
git clone https://github.com/G1Joshi/Agent-Skills/blob/main/skills/ai-ml/dalle/SKILL.mdView on GitHub Overview
DALL-E 3/4 is OpenAI's image model designed for strong prompt adherence, drawing exactly what you ask, including complex text. It integrates with ChatGPT and the OpenAI API, and supports in-chat editing for quick refinement.
How This Skill Works
Access DALL-E through the OpenAI API (dall-e-3) or ChatGPT. You provide a detailed prompt and, if needed, make edits by selecting a region in the image and issuing commands like 'Make the cat blue'; the API can return either the image or a revised prompt.
When to Use It
- ChatGPT integration: draw diagrams or visuals inside a chat flow.
- Complex instructions: render scenes like 'A cat holding a sign that says Hello World'.
- Safety and compliance: enforce guardrails against NSFW or copyrighted content.
- Rapid prototyping: generate quick visuals for concept testing via the API.
- Editing in-chat: refine composition by selecting areas and prompting changes.
Quick Start
- Step 1: Generate an image by sending a detailed prompt via the OpenAI API (dall-e-3) or in ChatGPT.
- Step 2: If edits are needed, select an area of the image and request changes (e.g., 'Make the cat blue').
- Step 3: Retrieve the final image or a revised prompt via JSON from the API or ChatGPT.
Best Practices
- Use via ChatGPT to leverage conversational refinement.
- Ask for JSON so the API returns the image or a revised prompt.
- Don't expect photorealism; DALL-E typically has a plastic or stylized look.
- Use the in-chat editing feature to adjust color, composition, or objects by area.
- Respect safety guardrails to avoid NSFW or copyright issues.
Example Use Cases
- Draw a diagram of a process inside a ChatGPT conversation.
- Render 'A cat holding a sign that says Hello World'.
- Prototype product visuals for a concept in minutes.
- Edit a generated image by selecting an area and requesting a color change.
- Create safe, copyright-compliant visuals for marketing materials.