doob-h-transform

Here is 1 public repository matching this topic...

sanjitdp / reward-guidance

Experiment code for 'Are we really tilting? The mechanics of reward guidance in flow and diffusion models' — plug-in Doob h-transform sampling, reward damping, best-of-n, and flow map reward guidance for Gaussian mixtures, a 2D checkerboard, and FLUX.1 text-to-image generation.

flux text-to-image generative-models best-of-n diffusion-models flow-matching stochastic-interpolants reward-hacking reward-guidance doob-h-transform

Updated May 7, 2026
Python

Improve this page

Add a description, image, and links to the doob-h-transform topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the doob-h-transform topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

doob-h-transform

Here is 1 public repository matching this topic...

sanjitdp / reward-guidance

Improve this page

Add this topic to your repo