返回时间轴
2021-01-05

OpenAI unveils DALL-E, showing AI can generate images from text descriptions

Capability Breakthrough

事件摘要

On January 5, 2021, OpenAI announced DALL-E, a 12-billion-parameter version of GPT-3 trained to generate images from text captions. It could produce novel compositions — 'a daikon radish in a tutu walking a dog' — that demonstrated combinatorial creativity. While the outputs were low-resolution, DALL-E proved that generative text-to-image was feasible, laying the groundwork for DALL-E 2, Stable Diffusion, and the entire AI image generation revolution that followed.

影响评估

  • Capability Leap +2 · Long-term

    First successful demonstration that a generative language model could produce coherent novel images from text descriptions, proving text-to-image generation was feasible and sparking the generative AI race.

    Affected Groups: AI researchers, computer vision researchers

  • Paradigm Shift +1 · Short-term

    Introduced the public to the concept of AI-generated images from text, preparing the cultural ground for the explosion of generative AI tools that followed in 2022.

    Affected Groups: general public, artists, creative professionals

共识度与来源

重要度 L1
分类 Capability Breakthrough
共识度 Broad Consensus
影响指数 4/10
  • 1

    URL: https://openai.com/index/dall-e/

    We've trained a neural network called DALL-E that creates images from text captions for a wide range of concepts expressible in natural language.
    Reference Evidence Citation logged Live source
  • 2

    URL: https://en.wikipedia.org/wiki/DALL-E

    DALL-E is a text-to-image generation model developed by OpenAI using a 12-billion parameter version of GPT-3.
    Reference Evidence Citation logged Live source