DDPM → DDIM difference (Denoising Diffusion Probabilistic Models → Denoising Diffusion Implicit Models)
DDPM → DDIM result
Implicit models
DDPM (from Denoising Diffusion Probabilistic Models)
DDIM
chosen mean to ensure $q_\sigma(x_t|x_0) = N(\sqrt\alpha x_0, \sqrt{1-\alpha_t}I)$ for all t (Lemma 1) non-Markovian : depend on $x_{t-1}$ and $x_0$
predict $\epsilon_t$ : denoising generative process calculate $x_0$ given $x_t$ with $x_t = \sqrt{\alpha_t}x_0 + \sqrt{1-\alpha_t}\epsilon$ → $x_0 = f(x_t)$
Training : same with DDPM optimize $\theta$ with forward $q$, reverse $p$
which is $J_{\sigma} = L_{\gamma} + C$ → assume optimal : $J_\sigma = L_\gamma$ (Theorem 1)
Sampling
deterministic when $\sigma_t = 0$ → consistency
cf. non-markovian : sampling from changing $\sigma$ cf. accelerated generation process : sampling trajectory
cf. relevance to neural ordinary differential equation (ODEs)
Result
Reverse process with sub-sequence selection $\tau$ select timesteps such that linear or Quadratic for some c
Closed form equations for each sampling step
Consistency (& accelerated generation)
Interpolation
cf. result