Skip to content
#

logit-lens

Here are 9 public repositories matching this topic...

Language: All
Filter by language

Open-source EU AI Act Annex IV compliance toolkit. Mechanistic interpretability + circuit discovery for transformers. One function call generates a court-ready evidence package

  • Updated Jun 1, 2026
  • Python

We optimize a compact latent state (frozen weights) to force failed multi-hop chains to output the missing answer D. 5 pre-registered controls show it simply injects D: carries it without the code-fact, leaves intermediates invisible, inert to hop corruption, and doesn’t transfer. No latent composition at 3B (Llama-3.2-3B, Qwen2.5-3B).

  • Updated Jun 4, 2026
  • Python

Improve this page

Add a description, image, and links to the logit-lens topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the logit-lens topic, visit your repo's landing page and select "manage topics."

Learn more