fix: disable use_cache during unsloth training to recover v0.8.x VRAM by Manuscrit · Pull Request #65 · longtermrisk/openweights

Manuscrit · 2026-05-07T13:28:00Z

The v0.9 rewrite of the response-only SFT path swapped trl.SFTTrainer for plain transformers.Trainer. SFTTrainer silently sets model.config.use_cache = False in its init; plain Trainer does not. Left enabled, the KV cache is materialised through every training forward, inflating VRAM significantly on large-vocab / long-context models (Qwen 3.x, etc.) and breaking jobs that fit comfortably on v0.8.2.

This adds apply_training_runtime_fixes(model) right after get_peft_model in the unsloth training entrypoint. It logs use_cache, _attn_implementation, and is_gradient_checkpointing so future runtime regressions are visible in worker logs, and flips use_cache to False when needed.

The weighted_sft job already disables use_cache explicitly, so no change is required there.

The v0.9 rewrite of the response-only SFT path swapped trl.SFTTrainer for plain transformers.Trainer. SFTTrainer silently sets model.config.use_cache = False in its __init__; plain Trainer does not. Left enabled, the KV cache is materialised through every training forward, inflating VRAM significantly on large-vocab / long-context models (Qwen 3.x, etc.) and breaking jobs that fit comfortably on v0.8.2. This adds apply_training_runtime_fixes(model) right after get_peft_model in the unsloth training entrypoint. It logs use_cache, _attn_implementation, and is_gradient_checkpointing so future runtime regressions are visible in worker logs, and flips use_cache to False when needed. The weighted_sft job already disables use_cache explicitly, so no change is required there. Co-authored-by: Cursor <cursoragent@cursor.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: disable use_cache during unsloth training to recover v0.8.x VRAM#65

fix: disable use_cache during unsloth training to recover v0.8.x VRAM#65
Manuscrit wants to merge 1 commit intomainfrom
fix/unsloth-train-use-cache-vram-regression

Manuscrit commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Manuscrit commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant