Every architecture surveyed — U-Net, DiT, MMDiT, FLUX — is a different box for computing this single object: the score function, the gradient of the log data density. Forward diffusion destroys information by following a noise schedule; reverse diffusion reconstructs by following the score back toward data. [1] [2]
Every training objective — ε-prediction, x₀-prediction, v-prediction, rectified flow — is a reparametrisation of the same score.
[EDM]
When you dial Midjourney's --stylize up, you are amplifying the conditional score gap relative to the unconditional one. The craft is craft; the substrate is differential geometry.
guidance_scale and --stylize parameter.
w amplifies the conditional-vs-unconditional gap, steering the denoising trajectory in latent space.
[EDM, Karras 2022]