Get the FREE Ultimate OpenClaw Setup Guide β†’

dalle

npx machina-cli add skill G1Joshi/Agent-Skills/dalle --openclaw
Files (1)
SKILL.md
983 B

DALL-E 3 / 4

DALL-E is OpenAI's image model. It excels at Prompt Adherenceβ€”it draws exactly what you ask for, including complex text.

When to Use

  • ChatGPT Integration: "Draw a diagram of..." inside a chat flow.
  • Complex Instructions: "A cat holding a sign that says 'Hello World'".
  • Safety: Strict guardrails against NSFW/Copyright.

Core Concepts

Integration

Available via OpenAI API (dall-e-3) and ChatGPT.

Editing

In ChatGPT, you can select an area of the image and say "Make the cat blue".

Best Practices (2025)

Do:

  • Use via ChatGPT: The conversational refinement is its superpower.
  • Ask for JSON: The API can return the image or a revised prompt.

Don't:

  • Don't expect photorealism: It often has a "plastic" or "smooth" look compared to Midjourney.

References

Source

git clone https://github.com/G1Joshi/Agent-Skills/blob/main/skills/ai-ml/dalle/SKILL.mdView on GitHub

Overview

DALL-E 3/4 is OpenAI's image model designed for strong prompt adherence, drawing exactly what you ask, including complex text. It integrates with ChatGPT and the OpenAI API, and supports in-chat editing for quick refinement.

How This Skill Works

Access DALL-E through the OpenAI API (dall-e-3) or ChatGPT. You provide a detailed prompt and, if needed, make edits by selecting a region in the image and issuing commands like 'Make the cat blue'; the API can return either the image or a revised prompt.

When to Use It

  • ChatGPT integration: draw diagrams or visuals inside a chat flow.
  • Complex instructions: render scenes like 'A cat holding a sign that says Hello World'.
  • Safety and compliance: enforce guardrails against NSFW or copyrighted content.
  • Rapid prototyping: generate quick visuals for concept testing via the API.
  • Editing in-chat: refine composition by selecting areas and prompting changes.

Quick Start

  1. Step 1: Generate an image by sending a detailed prompt via the OpenAI API (dall-e-3) or in ChatGPT.
  2. Step 2: If edits are needed, select an area of the image and request changes (e.g., 'Make the cat blue').
  3. Step 3: Retrieve the final image or a revised prompt via JSON from the API or ChatGPT.

Best Practices

  • Use via ChatGPT to leverage conversational refinement.
  • Ask for JSON so the API returns the image or a revised prompt.
  • Don't expect photorealism; DALL-E typically has a plastic or stylized look.
  • Use the in-chat editing feature to adjust color, composition, or objects by area.
  • Respect safety guardrails to avoid NSFW or copyright issues.

Example Use Cases

  • Draw a diagram of a process inside a ChatGPT conversation.
  • Render 'A cat holding a sign that says Hello World'.
  • Prototype product visuals for a concept in minutes.
  • Edit a generated image by selecting an area and requesting a color change.
  • Create safe, copyright-compliant visuals for marketing materials.

Frequently Asked Questions

Add this skill to your agents
Sponsor this space

Reach thousands of developers β†—