Proposal
I'd like to propose a cookbook entry covering safe autonomous agent operations — the patterns needed to run Claude-powered agents in production without things going sideways.
Why this matters
The cookbook has great examples for individual API calls, tool use, and RAG — but nothing on how to safely run agents that operate autonomously over extended periods with real external tools (sending emails, managing CRM, posting to LinkedIn, processing invoices).
The community is hitting these gaps hard. On the Claude Code repo alone, there are multiple open issues about agents self-approving actions, bypassing permission boundaries, and losing safety context during compaction (#36690, #32029, #32646, #37157).
What the cookbook entry would cover
Based on our experience running 30+ autonomous agents in production since early 2025, we'd propose covering:
- Graduated autonomy — How to classify agent actions by impact level and enforce appropriate approval gates
-
- Plan > Act > Verify loops — Structuring agent execution so every action is auditable and reversible
-
- Scoped credentials — Never hardcoding secrets; declaring capability sets per agent
-
- Prompt injection defense — Scanning external data before processing
-
- Trust gradients — Expanding agent autonomy as reliability is demonstrated
-
- Surviving context loss — Maintaining safety state across compaction and session handoffs
Existing material
We've already open-sourced everything needed:
- SOSA methodology spec: SOSA.md
-
-
-
- Security guide: SECURITY.md
Happy to contribute a cookbook entry (Jupyter notebook or markdown) with runnable examples. Would this be a welcome addition?
Proposal
I'd like to propose a cookbook entry covering safe autonomous agent operations — the patterns needed to run Claude-powered agents in production without things going sideways.
Why this matters
The cookbook has great examples for individual API calls, tool use, and RAG — but nothing on how to safely run agents that operate autonomously over extended periods with real external tools (sending emails, managing CRM, posting to LinkedIn, processing invoices).
The community is hitting these gaps hard. On the Claude Code repo alone, there are multiple open issues about agents self-approving actions, bypassing permission boundaries, and losing safety context during compaction (#36690, #32029, #32646, #37157).
What the cookbook entry would cover
Based on our experience running 30+ autonomous agents in production since early 2025, we'd propose covering:
Existing material
We've already open-sourced everything needed:
Happy to contribute a cookbook entry (Jupyter notebook or markdown) with runnable examples. Would this be a welcome addition?