OpenAI's image model focused on prompt adherence, available via the OpenAI API (dall-e-3) and ChatGPT.

How do I edit generated images?

In ChatGPT, you can select an area of the image and request changes (color, object, or composition) using simple prompts.

Can I get the image or a revised prompt via JSON?

Yes. The API can return the final image directly or provide a revised prompt for further iteration, and you can request JSON responses.

dalle

npx machina-cli add skill G1Joshi/Agent-Skills/dalle --openclaw

Files (1)

SKILL.md

983 B

DALL-E 3 / 4

DALL-E is OpenAI's image model. It excels at Prompt Adherence—it draws exactly what you ask for, including complex text.

When to Use

ChatGPT Integration: "Draw a diagram of..." inside a chat flow.
Complex Instructions: "A cat holding a sign that says 'Hello World'".
Safety: Strict guardrails against NSFW/Copyright.

Core Concepts

Integration

Available via OpenAI API (dall-e-3) and ChatGPT.

Editing

In ChatGPT, you can select an area of the image and say "Make the cat blue".

Best Practices (2025)

Do:

Use via ChatGPT: The conversational refinement is its superpower.
Ask for JSON: The API can return the image or a revised prompt.

Don't:

Don't expect photorealism: It often has a "plastic" or "smooth" look compared to Midjourney.

References

OpenAI DALL-E

Source

git clone https://github.com/G1Joshi/Agent-Skills/blob/main/skills/ai-ml/dalle/SKILL.mdView on GitHub

Overview

DALL-E 3/4 is OpenAI's image model designed for strong prompt adherence, drawing exactly what you ask, including complex text. It integrates with ChatGPT and the OpenAI API, and supports in-chat editing for quick refinement.

How This Skill Works

Access DALL-E through the OpenAI API (dall-e-3) or ChatGPT. You provide a detailed prompt and, if needed, make edits by selecting a region in the image and issuing commands like 'Make the cat blue'; the API can return either the image or a revised prompt.

When to Use It

ChatGPT integration: draw diagrams or visuals inside a chat flow.
Complex instructions: render scenes like 'A cat holding a sign that says Hello World'.
Safety and compliance: enforce guardrails against NSFW or copyrighted content.
Rapid prototyping: generate quick visuals for concept testing via the API.
Editing in-chat: refine composition by selecting areas and prompting changes.

Quick Start

Step 1: Generate an image by sending a detailed prompt via the OpenAI API (dall-e-3) or in ChatGPT.
Step 2: If edits are needed, select an area of the image and request changes (e.g., 'Make the cat blue').
Step 3: Retrieve the final image or a revised prompt via JSON from the API or ChatGPT.

Best Practices

Use via ChatGPT to leverage conversational refinement.
Ask for JSON so the API returns the image or a revised prompt.
Don't expect photorealism; DALL-E typically has a plastic or stylized look.
Use the in-chat editing feature to adjust color, composition, or objects by area.
Respect safety guardrails to avoid NSFW or copyright issues.

Example Use Cases

Draw a diagram of a process inside a ChatGPT conversation.
Render 'A cat holding a sign that says Hello World'.
Prototype product visuals for a concept in minutes.
Edit a generated image by selecting an area and requesting a color change.
Create safe, copyright-compliant visuals for marketing materials.

Frequently Asked Questions

Add this skill to your agents