Add scope/refusal contract to chat system prompt (closes #101) by vahid-ahmadi · Pull Request #102 · PolicyEngine/policyengine-uk-chat

vahid-ahmadi · 2026-06-08T08:40:56Z

Summary

Adds a SCOPE & REFUSAL: section near the top of SYSTEM_PROMPT in backend/routes/chatbot.py that makes "out of scope" an explicit contract the model follows. It defines what is in scope (UK tax/benefit microsimulation over the datasets and years capabilities() reports) and out of scope (non-UK policy, macro forecasting, unannounced/future Budgets, legal/tax-filing advice, anything capabilities() reports as not modelled), with clear off-topic, unmodelled, and partial-answer rules.

Rationale

Today the chat has no first-class handling for off-topic or unmodelled questions:

Off-topic questions are answered anyway, paying full input/output cost (system prompt + cached reference.md) for something that should be declined in one sentence.
On-topic-but-unmodelled questions (macro/GDP/inflation, non-UK policy) degrade into re-running run_python and re-guessing API shapes instead of stopping after one capabilities() check and saying "not modelled."

The only prior guardrail was a single buried line. This section replaces that with an explicit in/out-of-scope list plus a stop-after-one-check rule.

A partial-answer rule and a personal-allowance/inflation example guard against false refusals: questions that touch a non-modelled dimension but can still be partially answered are answered with the limitation explained, not declined.

Notes

Prompt-only change — no new tools, no change to _build_system_blocks, no run_python sandbox change. No added LLM call or latency.
This is "Layer 2" of the broader layered plan for off-topic/uncomputable questions, complementing the off-topic pre-flight topic gate (PR Cut wasted tokens (off-topic gate) and cold-start latency (/chat/backends warmup) #95) and graceful failure on the iteration cap (Cap run_python iterations with a best-effort fallback message #84 / PR Cap chat iterations at 12 with graceful fallback (closes #84) #87).

Closes #101

🤖 Generated with Claude Code

vercel · 2026-06-08T08:41:02Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
policyengine-uk-chat	Ready	Preview, Comment	Jun 15, 2026 2:56pm

github-actions · 2026-06-08T08:41:44Z

Beta preview is ready.

Frontend: open preview
Backend: open backend

Add a SCOPE & REFUSAL section near the top of SYSTEM_PROMPT defining what is in scope (UK tax/benefit microsimulation over the datasets and years capabilities() reports) and out of scope (non-UK policy, macro forecasting, unannounced Budgets, legal/tax-filing advice, anything capabilities() reports as not modelled). Off-topic questions are declined in one sentence with no tool calls; on-topic-but-unmodelled questions stop after a single capabilities() check instead of looping or guessing API shapes. A partial-answer rule plus a personal-allowance/inflation example guard against false refusals. Prompt-only change: no new tools, no change to _build_system_blocks, no run_python sandbox change. Closes #101 Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

anth-volk · 2026-06-15T14:21:33Z

+- Partial-answer rule: a question that touches a non-modelled dimension but can
+  still be partially answered should be answered with the limitation explained,
+  NOT refused.
+- For example, "how will raising the personal allowance affect inflation?"
+  should be answered by computing the modelled fiscal and distributional impact
+  and clearly noting that second-round macro effects (inflation, behaviour) lie
+  outside the microsimulation — not declined outright.


@vahid-ahmadi You sure about this part?

The decline list flatly listed "inflation" as out-of-scope, but the flagship partial-answer example was an inflation question it said NOT to decline — contradictory guidance for the same query type. Scope the macro decline to pure-forecast asks ("what will inflation/GDP/employment be?") with no modelled lever, make the partial-answer rule explicitly take precedence when a modelled policy is in the question, and reword the example so the answer addresses the modelled part rather than implying it answered the inflation question. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

vahid-ahmadi · 2026-06-15T14:55:53Z

@anth-volk heads-up on a fix I just pushed (1108bc2) to the SCOPE & REFUSAL block, in case it affects your review.

Contradiction in the original text: the decline list named inflation as flatly out-of-scope —

Out of scope (decline): … macroeconomic forecasting (GDP, inflation, employment, market reactions)

— but the flagship partial-answer example was an inflation question it said not to decline. So for the same query type the prompt gave opposite instructions; the model could swing between a curt refusal and a full simulation.

Fix:

Scoped the macro decline to the pure-forecast case — "what will inflation/GDP/employment be?" with no modelled tax-benefit lever in the question — so it no longer collides with the example.
Made the partial-answer rule explicitly take precedence when a modelled policy is in the question, so the tie-break is unambiguous.
Reworded the example so the answer makes clear it's addressing the modelled part (fiscal/distributional) and that the macro part is out of scope — rather than implying it answered the inflation question.

Note this PR also got rebased onto current main earlier (main had refactored SYSTEM_PROMPT out of chatbot.py into section constants in prompts.py), so the contract now lives as a SCOPE_AND_REFUSAL section there. Verified: prompt assembles correctly and test_prompts.py passes.

Still flagging, as before, that prompt-behaviour like this really wants an eval case (topic-gate / #52 harness) to lock it in rather than trusting wording alone — happy to follow up.

anth-volk · 2026-06-15T17:43:56Z

Following up on my review — a design suggestion on the partial-answer rule specifically (now that 1108bc2 has resolved the inflation contradiction).

As written, the rule is compute-first: when a question centres on a modelled reform but also touches a non-modelled dimension, the model runs the simulation immediately and caveats the unmodelled part inline. I'd like us to consider flipping it to confirm-first:

State the boundary up front — what it can and can't answer for this question.
Offer the modelled analysis.
Run the simulation only once the user agrees.

So "how will raising the personal allowance affect inflation?" would first return something like: "I can't model the inflation (second-round macro) effect — that's outside the microsimulation. I can show the fiscal and distributional impact of raising the personal allowance. Want that?" — and compute on confirmation, rather than computing immediately.

Why I think this is worth it:

It avoids spending the expensive path on an unwanted answer. The partial-answer route runs run_economy_simulation, the heaviest tool we have. If the user actually wanted the inflation answer (which we can't give), compute-first burns a full distributional simulation and then tells them the thing they cared about is out of scope. Confirm-first spends nothing on the engine until the user says the modelled slice is useful.
It sets expectations before presenting numbers, which reduces the risk of a partial answer reading as if it were complete — the exact failure mode the caveat is trying to prevent, but handled before the work rather than after.

The tradeoff is a round-trip of latency/friction on the common case where the user did want the modelled answer and would just say "yes" — and it cuts against the app's general eager-compute stance ("every number must come from a tool result you just computed").

One framing question for you: this confirm-first shape is conceptually a scoped Plan-mode turn — PLAN_MODE_DIRECTIVE already encodes "don't call tools, ask first." So rather than adding a fourth bespoke rule to SCOPE_AND_REFUSAL, it might be cleaner to express partial-answer cases as "enter a Plan-mode-style turn" and reuse that machinery. Worth deciding whether this is a new rule or a mode interaction.

Not blocking — the current compute-first version is internally consistent now. Flagging it as a behavioural choice I'd like us to make deliberately, ideally pinned by an eval case (the personal-allowance → inflation flow) once we settle where the eval harness lives post-#52.

vercel Bot deployed to Preview June 8, 2026 08:41 View deployment

vahid-ahmadi requested a review from anth-volk June 8, 2026 09:28

vahid-ahmadi force-pushed the feat/system-prompt-scope-refusal branch from ba97ba0 to 98c30d1 Compare June 15, 2026 14:14

vercel Bot deployed to Preview June 15, 2026 14:15 View deployment

anth-volk reviewed Jun 15, 2026

View reviewed changes

vercel Bot deployed to Preview June 15, 2026 14:56 View deployment

This was referenced Jun 15, 2026

Add opt-in scope router to /chat/message (load the heavy background only when needed) #109

Draft

Design note: scope-aware routing (load the background only when needed) #110

Closed

Version-stamp reference.md and warn on engine drift (closes #104) #106

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add scope/refusal contract to chat system prompt (closes #101)#102

Add scope/refusal contract to chat system prompt (closes #101)#102
vahid-ahmadi wants to merge 2 commits into
mainfrom
feat/system-prompt-scope-refusal

vahid-ahmadi commented Jun 8, 2026

Uh oh!

vercel Bot commented Jun 8, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 8, 2026

Uh oh!

anth-volk Jun 15, 2026

Uh oh!

vahid-ahmadi commented Jun 15, 2026

Uh oh!

anth-volk commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vahid-ahmadi commented Jun 8, 2026

Summary

Rationale

Notes

Uh oh!

vercel Bot commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 8, 2026

Uh oh!

anth-volk Jun 15, 2026

Choose a reason for hiding this comment

Uh oh!

vahid-ahmadi commented Jun 15, 2026

Uh oh!

anth-volk commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vercel Bot commented Jun 8, 2026 •

edited

Loading