MandateKit — Security

Threat model

MandateKit issues and checks mandates: signed statements of what an AI agent may spend on. The verifier sits in front of a payment authorization, so the adversary's goal is to get a transaction approved that the legitimate issuer did not authorize — by forging a mandate, widening scope, replaying a one-shot mandate, or crashing the verifier.

A valid signature proves integrity (the bytes were not altered) and that the holder of a key signed it. It does not prove the signer is trusted. Authority comes from the integrator pinning which issuer keys they accept.

v0 review (2026-06-05)

An adversarial review found five issues; all are fixed and locked by a security-regression test block in both SDKs (test_attack* / attack N tests).

#	Severity	Issue	Fix
1	Critical	`verify()` trusted the public key embedded in the envelope — anyone could sign their own mandate and be allowed	`verify()` requires `trusted_keys`/`trustedKeys`; fails closed without it (or an explicit `allow_unverified_issuer` opt-in)
2	High	Empty `constraints` → allow-all	Deny unless ≥1 hard scope constraint (`categories`/`merchants`/`max_amount`) is present
3	High	Empty allow-list (`[]`) silently skipped → fail-open	Presence-based checks; an empty allow-list allows nothing
4	High	`max_uses` parsed and stored but never enforced (replayable)	Removed from v0; unknown/unenforceable constraint keys now deny
5	Medium	Deeply nested JSON crashed `verify()` (`RecursionError`)	Iterative depth/size bound (`MAX_DEPTH`/`MAX_NODES`) before canonicalizing
—	Medium	Float amounts diverge in cross-language canonicalization	Amounts must be integers (compared like-for-like, caller-chosen unit); floats rejected at build and verify

Hardening pass (2026-06-05)

Following the review, MandateKit moved onto vetted primitives and added automated and property-based testing:

Constant-time, vetted Ed25519. Signing/verification use the cryptography library by default. The pure-Python RFC 8032 reference remains only as a zero-dependency fallback (not constant-time) and warns at import.
RFC 8785 (JCS) canonicalization via the rfc8785 (Python) and canonicalize (TypeScript) libraries — byte-identical across both SDKs.
Amounts bounded to the JS-safe integer range [0, 2**53-1], so JCS can canonicalize them identically everywhere.
verify() never throws. Property-based tests (Hypothesis / fast-check) fuzz it with arbitrary hostile input; signature checking and every constraint path tolerate malformed types and return a verdict.
CI security gates: CodeQL, Semgrep, and Bandit run on every push, plus a job that exercises the dependency-free fallback path.

Known limitations (by design, v0)

No replay / velocity / usage enforcement. The verifier is stateless. These require the roadmap registry. max_uses is omitted rather than ignored.
Not independently audited. Automated tooling and property tests are not a substitute for a third-party audit.
Intent-basket alignment fails open (a scorer error does not deny) by deliberate choice; integrators wanting fail-closed should enforce it themselves.

Reporting

This is a pre-release v0 prototype. Do not use it to authorize real funds.

Audit status (v0)

This is a v0 release. It has been independently hardened — CodeQL, bandit, semgrep, property-based tests, and adversarial tier 1-2 reviews, all passing in CI — but it has not had a third-party security audit. Treat it accordingly for anything high-stakes.

Security review welcome

We actively want researcher eyes on this. If you find a fail-open, a signature bypass, an SSRF path, or any way to defeat a guarantee in this document, please open an issue. Credit given. The shared crypto core (Ed25519 + RFC 8785 canonicalization) and the verifier's fail-closed paths are the highest-value targets.

v0.0.2 hardening (2026-06-10 internal audit)

Expiry parsing unified. The TypeScript verifier no longer uses the lenient Date.parse (which accepted strings like "30 June 2026" and read naive timestamps as local time). It now uses a strict ISO-8601 parser matching Python's fromisoformat, with naive timestamps anchored to UTC, so the same signed mandate yields the same verdict in both SDKs.
Scope vs amount (clarification). A mandate is "scoped" if it sets any one of categories, merchants, or max_amount. A mandate without max_amount therefore carries no spending ceiling; if you need one, set max_amount explicitly or enforce a cap out of band.
Canonicalization fallback now fails closed on floats / out-of-safe-range integers (see the shared crypto core note).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Security

SECURITY.md

MandateKit — Security

Threat model

v0 review (2026-06-05)

Hardening pass (2026-06-05)

Known limitations (by design, v0)

Reporting

Audit status (v0)

Security review welcome

v0.0.2 hardening (2026-06-10 internal audit)

There aren't any published security advisories

Security: major-matters/mandatekit

Security

SECURITY.md

MandateKit — Security

Threat model

v0 review (2026-06-05)

Hardening pass (2026-06-05)

Known limitations (by design, v0)

Reporting

Audit status (v0)

Security review welcome

v0.0.2 hardening (2026-06-10 internal audit)

There aren't any published security advisories