Mentatcurated
▸ Concept also: text-to-image, image synthesis, diffusion model

Image generation

A model that produces images from a text prompt or other conditioning signal by learning the statistical structure of large image datasets.

In a nutshell

An image generation model takes a text description — or another image, a sketch, a layout — and produces pixels that match it. Most current systems are trained to reverse a process of adding noise: the model learns how noise degrades real images, then runs that process backwards at inference time. The hard part is coherence at scale: preserving geometry, respecting fine-grained instructions, and keeping outputs consistent across seeds and styles. Quality now rivals studio photography for many subjects; the gap that remains is reliable compositional control.

Where it came from

Year2020
SourceHo et al. — Denoising Diffusion Probabilistic Models (NeurIPS 2020)
Why it matteredEstablished the diffusion framework that underlies most current text-to-image systems.

How this connects

Tap a node to open it