Latent diffusion model

Latent Diffusion Model
Latent Diffusion Model
Original author	CompVis
Initial release	December 20, 2021
Written in	Python
Type	Generative model; Diffusion model;
License	MIT
Repository	github.com/CompVis/latent-diffusion

The Latent Diffusion Model (LDM) is a diffusion model architecture developed by the CompVis (Computer Vision & Learning) group at LMU Munich.

Introduced in 2015, diffusion models (DMs) are trained with the objective of removing successive applications of noise (commonly Gaussian) on training images. The LDM is an improvement on standard DM by performing diffusion modeling in a latent space, and by allowing self-attention and cross-attention conditioning.

LDMs are widely used in practical diffusion models. For instance, Stable Diffusion versions 1.1 to 2.1 were based on the LDM architecture.