[Managed Agents] Cookbook: Output verification before agent actions

The new Managed Agents architecture decouples brain, hands, and session — but there's no cookbook example showing how to verify agent output before acting on it.

`CMA_gate_human_in_the_loop.ipynb` shows human approval gates. A natural complement is an **automated verification gate** — checking that the agent's claims are factually accurate before executing downstream actions.

**Proposed notebook: `CMA_verify_output_before_action.ipynb`**

Pattern:
1. Agent produces output (research, analysis, recommendation)
2. Verification "hand" checks claims against evidence
3. Session log records the verification receipt alongside the action
4. Agent proceeds only if trust threshold is met

This fits the `execute(name, input) → string` tool interface described in the [Scaling Managed Agents](https://www.anthropic.com/engineering/managed-agents) post — verification is just another stateless hand.

We have a working implementation using VeroQ Shield as the verification tool:

```python
from agents import Agent
from veroq_agentmesh import veroq_output_guardrail

agent = Agent(
    name="Analyst",
    output_guardrails=[veroq_output_guardrail],
)
```

Trips `OutputGuardrailTripwireTriggered` if claims are contradicted or confidence drops below threshold. Each verification produces a receipt with per-claim verdicts and evidence chains — natural session log events.

Happy to contribute the notebook if there's interest.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Managed Agents] Cookbook: Output verification before agent actions #518

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[Managed Agents] Cookbook: Output verification before agent actions #518

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions