Introduction

On 15 May 2023 a new paper has been published that proposes fundamental changes to the schedulers involved in (stable) diffusion.

Update: The changes proposed in the paper have been merged into Hugginface diffusers (Pull Request #3664)[huggingface/diffusers#3664].

Common Diffusion Noise Schedules and Sample Steps are Flawed

We discover that common diffusion noise schedules do not enforce the last timestep to have zero signal-to-noise ratio (SNR), and some implementations of diffusion samplers do not start from the last timestep. Such designs are flawed and do not reflect the fact that the model is given pure Gaussian noise at inference, creating a discrepancy between training and inference. We show that the flawed design causes real problems in existing implementations. In Stable Diffusion, it severely limits the model to only generate images with medium brightness and prevents it from generating very bright and dark samples. We propose a few simple fixes: (1) rescale the noise schedule to enforce zero terminal SNR; (2) train the model with v prediction; (3) change the sampler to always start from the last timestep; (4) rescale classifier-free guidance to prevent over-exposure. These simple changes ensure the diffusion process is congruent between training and inference and allow the model to generate samples more faithful to the original data distribution.

By implementing the changes from the paper, one can generate much brighter / darker images, which was previously not possible.

In this notebook I try to implement the changes from the paper in actual Python code to try them out.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
common_diffusion_noise_schedulers_are_flawed.ipynb		common_diffusion_noise_schedulers_are_flawed.ipynb
comparison.png		comparison.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Common Diffusion Noise Schedules and Sample Steps are Flawed

About

Releases

Packages

Languages

Max-We/sf-zero-signal-to-noise

Folders and files

Latest commit

History

Repository files navigation

Introduction

Common Diffusion Noise Schedules and Sample Steps are Flawed

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages