Skip to content

update checkpointing docs#282

Merged
yaseenisolated merged 3 commits into
masterfrom
checkpoint-docs-2
Jun 17, 2026
Merged

update checkpointing docs#282
yaseenisolated merged 3 commits into
masterfrom
checkpoint-docs-2

Conversation

@yaseenisolated

Copy link
Copy Markdown
Contributor

No description provided.

@mintlify

mintlify Bot commented Jun 11, 2026

Copy link
Copy Markdown
Contributor

Preview deployment for your docs. Learn more about Mintlify Previews.

Project Status Preview Updated (UTC)
cerebrium 🟢 Ready View Preview Jun 11, 2026, 9:36 AM

💡 Tip: Enable Workflows to automatically generate PRs for you.


If you are getting an EngineCoreDead exception add `async_scheduling=False` to your AsyncEngineArgs and it should succeed.

The larger the size of the memory checkpoint the slower the restore is. We can reduce the size of the snapshot substantially and improve startup times by dropping the KV Cache before checkpoint and recreating it after restore. vLLM has functionality that does this built in as part of [vLLM Sleep Mode](https://docs.vllm.ai/en/latest/features/sleep_mode/).

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are users able to see the size of the snapshot? Do we log it?

@yaseenisolated yaseenisolated merged commit 5143d69 into master Jun 17, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants