Redesign calibration loss as capped weighted MAPE by MaxGhenis · Pull Request #96 · PolicyEngine/populace

MaxGhenis · 2026-06-17T15:15:18Z

Summary

Redesigns Populace calibration loss from squared bounded relative error to capped weighted MAPE:

loss = weighted_mean(min(abs((estimate - target) / scale), cap))
scale = max(abs(target), abs(initial_estimate), 1)
cap = 10  # 1000% per target

This fixes the a912 failure mode where zero-valued/tiny fiscal rows dominated the objective and prevented meaningful movement on major targets like CTC.

Changes

Adds fixed target-loss scales in populace.calibrate, defaulting to max(abs(target), abs(initial_estimate), 1) after matrix compilation.
Caps each target's scaled miss at 1000%.
Lowers the default Adam learning rate to 0.02, which is better behaved for absolute-loss gradients.
Removes the obsolete compiled-target -1 guard from matrix compilation; the new loss has no (target + 1) denominator.
Changes the US fiscal refresh builder from raw dollar target weighting / CTC-only boost to semantic weights:
- national critical fiscal roles get higher priority,
- state rows are downweighted,
- state rows do not inherit national role boosts.
Updates PUF aggregate diagnostics and docs to report the new loss formula.

Sanity check

On the published a912 diagnostics, the old objective's top 10 rows accounted for ~99.9999989% of total loss. Under the new capped MAPE scoring on the same final estimates, the top 10 rows account for ~0.6%, so loss is no longer owned by a handful of zero/tiny rows.

Verification

uv run python -m pytest -q passed
uv run ruff check ... passed on changed Python files
git diff --check passed
/cycle review: first pass found state rows were still getting national role boosts; fixed with regression coverage. Second pass found no actionable issues.

Redesign calibration loss as capped weighted MAPE

b9729d9

MaxGhenis merged commit 8aa3dbb into main Jun 17, 2026
4 checks passed

MaxGhenis deleted the codex/loss-redesign-20260617 branch June 17, 2026 15:15

daphnehanse11 mentioned this pull request Jun 17, 2026

Gate US fiscal refresh base population scale #93

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Redesign calibration loss as capped weighted MAPE#96

Redesign calibration loss as capped weighted MAPE#96
MaxGhenis merged 1 commit into
mainfrom
codex/loss-redesign-20260617

MaxGhenis commented Jun 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

MaxGhenis commented Jun 17, 2026

Summary

Changes

Sanity check

Verification

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant