Synapse-Graph (AI Autopsy Engine)

Turn LLM internals into observable, governable infrastructure

Live Presentation

HTML Presentation — Open in browser

Scene	Description
Intro	Project overview, features, graph visualization
Architecture (Interactive)	Clickable component diagram
Tech Stack	Dependencies
Demo	Demo video
OpenMetadata	Governance plane
Status	Capabilities + gaps
Thank You	Credits

Video Demos

Complete Demo Walkthrough — 2-3 min circuit discovery → quarantine → re-run
Product Demo — Live run showing active heads, lineage, quarantine

💡 The Motivation: Breaking the Black Box

Synapse-Graph started from a simple frustration: when an LLM hallucinates, most teams are still left choosing between two bad fixes.

1. Prompt Engineering
Ask the system prompt to behave better and hope the next response cooperates.

2. Retraining / Fine-tuning
Spend compute and engineering time to train the bad behavior out of the model.

Most observability tools only inspect the surface area of the model: prompts, tokens, latency, and logs. Synapse-Graph treats the AI like an engine, not a black box. If a spark plug misfires, you do not replace the entire car; you identify the exact component and fix it.

Core idea: combine Mechanistic Interpretability with Enterprise Data Governance so hallucinations can be traced, explained, and corrected in real time without retraining.

⚙️ Engineering Philosophy: How It Actually Works

Synapse-Graph performs live neural surgery by operating directly on model tensors instead of wrapping the API surface.

1. 🔍 The PyTorch Shadow Tracer

The tracer uses register_forward_hook on attention modules to observe generation as it happens.

Captures attention matrices at the layer, head, and position level
Records projection outputs during inference
Keeps tracing lightweight enough to run alongside generation

2. 🛡️ The OpenMetadata Governance Layer

We map neural structure into familiar catalog primitives so the model can be governed like enterprise data.

Neural Concept	OpenMetadata Representation
Model	Database
Transformer layers	Tables
Attention heads	Columns

That mapping turns thought paths into lineage edges, making it possible to tag specific heads as DEFECTIVE and feed that signal back into runtime control.

3. ⚡ Causal Ablation

Correlation is not causation, so the backend runs an automated $O(n^2)$ ablation sweep to isolate the exact circuit responsible for the behavior.

Identify a suspicious head
Zero its projection output and re-run the generation
Measure whether the hallucination rate drops
Promote the confirmed culprit to DEFECTIVE in OpenMetadata and mask it in the FastAPI proxy on future requests

The result is a debugging loop that is:

Targeted rather than speculative
Deterministic rather than anecdotal
Cheap to operate because it avoids retraining

The Problem

LLMs are powerful but opaque. Current observability stops at prompts, tokens, latency, and logs. They don't answer:

Which layers and heads were most active for this response?
Can we trace a "thought path" through the network?
Can governance tools intervene on specific neural components?

The Solution

Synapse-Graph repurposes OpenMetadata as a governance and lineage system for transformer internals:

Model → Database
Transformer layers → Tables
Attention heads → Columns
High-activation paths → Lineage edges
DEFECTIVE tag → Runtime control signal that masks a head during next generation

The Impact

Turns model internals into inspectable infrastructure with familiar data-platform primitives.

Architecture

flowchart LR
    subgraph Dashboard["🎨 Operator Dashboard"]
        D["Next.js<br/>React<br/>@xyflow/react"]
    end

    subgraph Proxy["⚡ Neural Proxy (FastAPI)"]
        P["Generation + Tracing<br/>Governance + SSE<br/>HeadMaskStore"]
    end

    subgraph Generation["🔥 Generation"]
        O["Ollama<br/>(Preferred)"]
    end

    subgraph Tracing["🔍 Tracing"]
        T["HF Tracer<br/>PyTorch hooks"]
    end

    subgraph Governance["🛡️ Governance"]
        OM["OpenMetadata<br/>Topology + Lineage<br/>Tags → Masks"]
        DEF["⛔ DEFECTIVE<br/>→ Runtime Mask"]
    end

    D -->|"REST + SSE"| P
    P -->|"Generation"| O
    P -->|"Tracing"| T
    P -->|"Topology<br/>Lineage<br/>Tags"| OM
    OM -->|"tag"| DEF

Interactive diagram: Click here for full interactive architecture

Backend Details

`backend/app/main.py` — FastAPI Application

REST Endpoints:

Endpoint	Method	Purpose
`/api/v1/state`	GET	Current runtime state
`/api/v1/generate`	POST	Full generation response
`/api/v1/generate/stream`	POST	SSE with trace steps
`/api/v1/autopsy/discover_circuit`	POST	Circuit discovery
`/api/v1/autopsy/discover_circuit/stream`	POST	SSE discovery progress
`/api/v1/autopsy/causal`	POST	Causal autopsy
`/api/v1/openmetadata/bootstrap`	POST	Bootstrap catalog
`/api/v1/openmetadata/sync-defects`	POST	Sync tags to masks
`/api/v1/openmetadata/quarantine`	POST	Quarantine heads
`/api/v1/webhooks/openmetadata`	POST	Webhook handler
`/api/v1/governance/local-mask`	POST	Set head mask
`/api/v1/governance/clear-local-masks`	POST	Clear masks
`/api/v1/hf/preload`	POST	Load HF tracer

Execution Modes:

AUTO — Prefer Ollama if available
FAST — Ollama + parallel HF tracing
FAITHFUL — Only HF with inline tracing

Hook-Based Attention Capture:

def _register_attention_hooks(model, layer_idx, hook_handles):
    # Registers register_forward_hook on attention modules
    # Captures: attention_weights, projection output
    
def _make_projection_mask_hook(layer_idx, head_idx):
    # Applies masking to output projection
    # Zeroes masked head's hidden states

Two-Level Masking:

Attention tensor masking
Projection masking (hidden states)

Default Models:

Ollama: qwen2.5:3b-instruct
HuggingFace: Qwen/Qwen2.5-1.5B-Instruct
Dashboard default: gpt2 (12 layers × 12 heads = 144 heads)

OpenMetadata Topology

erDiagram
    Service ||--o| Database : "Synapse_Neural_Service"
    Database ||--o| Schema : "Transformer_Graph"
    Schema ||--o| Table : "Layer_N (per layer)"
    Schema ||--o| Table : "Prompt_Ingress"
    Schema ||--o| Table : "Response_Egress"
    Table ||--o| Column : "Head_N (per head)"

    Service {
        string name "Synapse_Neural_Service"
        string type "mysql"
    }

    Column {
        string name "Head_N"
        string type "FLOAT"
        string tag "DEFECTIVE/QUARANTINED"
    }

Classification & Tags:

Classification: SynapseQuarantine
Tag: DEFECTIVE (color: #39FF14)

Lineage: Prompt_Ingress → Layer_1 → ... → Layer_N → Response_Egress

Frontend Details

Dashboard Components

frontend/components/synapse-dashboard.tsx — Main dashboard with metrics, discovery panel, governance controls
frontend/components/synapse-graph.tsx — @xyflow/react graph visualization
frontend/components/activation-chart.tsx — Per-layer, per-head activation charts
frontend/components/console-log.tsx — Real-time log stream display

Dashboard Features

Metric Cards:

Generation Backend (Ollama live / HF inline)
Trace Fidelity (Exact / Proxy evidence)
Lineage Depth (active hops)
Masked Heads count

Causal Discovery Panel:

Target token input (hallucination to remove)
top_k_heads slider (1-20)
max_pair_sweeps slider (0-190)
Run Discovery → View Overlay → Quarantine buttons

Governance Panel:

Quarantine Top Head
Clear Local Masks
Sync Defects button

Tech Stack

Backend (`backend/pyproject.toml`)

[project]
requires-python = ">=3.11,<3.13"

dependencies = [
    "fastapi>=0.115.0",
    "torch>=2.4.0",
    "transformers>=4.46.0",
    "openmetadata-ingestion>=1.12.0",
    "httpx>=0.28.0",
    "pydantic-settings>=2.7.0",
    "uvicorn[standard]>=0.32.0",
    "accelerate>=1.1.0",
    "cachetools>=5.3.0",
]

Frontend (`frontend/package.json`)

{
  "dependencies": {
    "next": "^15.2.0",
    "react": "^19.0.0",
    "@xyflow/react": "^12.4.4",
    "recharts": "^2.15.0",
    "lucide-react": "^0.468.0"
  },
  "devDependencies": {
    "tailwindcss": "^3.4.16",
    "typescript": "^5.7.2"
  }
}

Quickstart

# Backend
python3.11 -m venv .venv && source .venv/bin/activate
pip install -e ./backend
cp backend/.env.example backend/.env
cd backend && python -m uvicorn app.main:app --reload --port 8000

# Frontend (new terminal)
cd frontend && npm install && npm run dev

Dashboard: http://localhost:3000

Demo Workflow

flowchart TD
    A["1. Start<br/>Boot dashboard"] --> B["2. Trace<br/>Submit prompt"]
    B --> C["3. Discover<br/>Run circuit discovery"]
    C --> D["4. Quarantine<br/>Push DEFECTIVE tags"]
    D --> E["5. Verify<br/>Re-run prompt<br/>Show masked heads"]
    E --> A

Start — Boot dashboard, verify "Ollama live" or "HF fallback"
Trace — Submit prompt → watch synapse graph light up
Discover — Enter hallucination token → run circuit discovery
Quarantine — Click "Quarantine" → push DEFECTIVE tags to OpenMetadata
Verify — Re-run prompt → show masked heads count increase

Repository Layout

Synapse-Graph/
├── backend/
│   ├── app/
│   │   ├── main.py          # FastAPI + endpoints
│   │   ├── inference.py     # Generation + tracing
│   │   └── om_client.py    # OpenMetadata client
│   └── tests/
│       ├── test_quarantine.py
│       └── test_discover_quarantine_integration.py
├── frontend/
│   ├── app/                 # Next.js app router
│   ├── components/         # Dashboard, graph, charts
│   └── lib/               # API client
├── architecture.html       # Interactive architecture diagram
└── first_frame.html        # GitHub Pages presentation

License

MIT

Connect

Vicky Kumar

algsoch

npdimagine@gmail.com · +91 8383848219

Project: GitHub Repo · Live Demo · YouTube Demo

Built for AI Interpretability

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.github/workflows		.github/workflows
artifacts		artifacts
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md
TRACEABILITY_IMPLEMENTATION.md		TRACEABILITY_IMPLEMENTATION.md
aae_sdk.py		aae_sdk.py
architecture.html		architecture.html
demo_project.mp4		demo_project.mp4
demo_server.py		demo_server.py
demo_uvicorn.log		demo_uvicorn.log
explain.md		explain.md
favicon.png		favicon.png
first_frame.html		first_frame.html
form.md		form.md
graph.mov		graph.mov
graph.mp4		graph.mp4
index.html		index.html
ingestor.log		ingestor.log
ingestor_server.py		ingestor_server.py
last_frame.html		last_frame.html
logo.png		logo.png
openmetadata_ingestor.py		openmetadata_ingestor.py
openmetadata_usage.html		openmetadata_usage.html
project_status.html		project_status.html
requirements_demo.txt		requirements_demo.txt
run_demo_test.py		run_demo_test.py
tech_stack.html		tech_stack.html
tour.js		tour.js
video_diagram.html		video_diagram.html
video_narration.md		video_narration.md
video_narration_demo.md		video_narration_demo.md
video_narration_frontend.md		video_narration_frontend.md
video_scene.html		video_scene.html
video_script.md		video_script.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Synapse-Graph (AI Autopsy Engine)

Live Presentation

Video Demos

💡 The Motivation: Breaking the Black Box

⚙️ Engineering Philosophy: How It Actually Works

1. 🔍 The PyTorch Shadow Tracer

2. 🛡️ The OpenMetadata Governance Layer

3. ⚡ Causal Ablation

The Problem

The Solution

The Impact

Architecture

Backend Details

`backend/app/main.py` — FastAPI Application

OpenMetadata Topology

Frontend Details

Dashboard Components

Dashboard Features

Tech Stack

Backend (`backend/pyproject.toml`)

Frontend (`frontend/package.json`)

Quickstart

Demo Workflow

Repository Layout

License

Connect

Vicky Kumar

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Synapse-Graph (AI Autopsy Engine)

Live Presentation

Video Demos

💡 The Motivation: Breaking the Black Box

⚙️ Engineering Philosophy: How It Actually Works

1. 🔍 The PyTorch Shadow Tracer

2. 🛡️ The OpenMetadata Governance Layer

3. ⚡ Causal Ablation

The Problem

The Solution

The Impact

Architecture

Backend Details

backend/app/main.py — FastAPI Application

OpenMetadata Topology

Frontend Details

Dashboard Components

Dashboard Features

Tech Stack

Backend (backend/pyproject.toml)

Frontend (frontend/package.json)

Quickstart

Demo Workflow

Repository Layout

License

Connect

Vicky Kumar

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`backend/app/main.py` — FastAPI Application

Backend (`backend/pyproject.toml`)

Frontend (`frontend/package.json`)

Packages