Skip to content

Default gradient_clipping to 1.0#8068

Open
sfc-gh-truwase wants to merge 1 commit into
masterfrom
grad-clip-default-1.0
Open

Default gradient_clipping to 1.0#8068
sfc-gh-truwase wants to merge 1 commit into
masterfrom
grad-clip-default-1.0

Conversation

@sfc-gh-truwase

Copy link
Copy Markdown
Collaborator

Summary

  • Change GRADIENT_CLIPPING_DEFAULT from 0. (disabled) to 1.0.

Motivation

With the old default, configs that omit gradient_clipping run unclipped. Most RL/LLM training (and the FSDP2 reference) clip at 1.0; this avoids silently-unclipped runs. Isolated into its own PR since it is a default behavior change.

Test plan

  • Init without gradient_clipping -> effective clip norm is 1.0.
  • Explicit gradient_clipping: 0.0 still disables clipping (override respected).

Made with Cursor

Change GRADIENT_CLIPPING_DEFAULT from 0. (disabled) to 1.0 so configs that omit
the key clip at 1.0 by default, matching common RL/LLM training and the FSDP2
reference. Explicit "gradient_clipping": 0.0 still disables clipping.

Signed-off-by: Olatunji Ruwase <tunji.ruwase@snowflake.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant