feat: tracing pipeline (schema rename + recorder + viewer + docs) by yanmxa · Pull Request #23 · genai-io/gen-code

yanmxa · 2026-05-15T10:01:34Z

Brings the rest of the tracing stack into main.

PRs #20, #21, #22 were merged into their stacked base branches (`trace/1-persistence`, `trace/2-schema-rename`, `trace/3-recorder`) rather than `main` — the stacked-PR base wasn't retargeted after #19 landed. This PR consolidates their cumulative content into main via a single merge.

What's in the diff (already reviewed)

refactor: unify record/payload naming + ContentBlock.Source field #20: `refactor: unify record/payload naming + ContentBlock.Source field` — `transcript.` → `session.` record types; `TranscriptID` → `SessionID`; `SystemRecord` → `SessionRecord`; ContentBlock.Source field; projector forward-compat (unknown patch paths ignored).
feat: trace recorder for inference / system / tools + content provenance #21: `feat: trace recorder for inference / system / tools + content provenance` — `session.Recorder` wired via `core.Config.OnEvent`; `inference.requested/responded` with digests; `system.section.added/removed` with caller; `tools.added/removed`; `` content tagged `Source="reminder"`; observer-replay-on-attach so the event chain is complete from t0. Fix: writes `session.started` before observer-driven telemetry.
feat: gen trace web viewer #22: `feat: gen trace web viewer` — localhost `gen trace` server, embedded SPA, polling-based tail + SSE. Fixes: HTML-escape session IDs; dedup records across SSE reconnects.
feat: tracing pipeline (schema rename + recorder + viewer + docs) #23: `docs: tracing event model` — `docs/tracing.md`.

Test

`go build ./...` clean
`go test ./...` — 51 packages pass, 0 FAIL

🤖 Generated with Claude Code

Naming rule: every record type follows <entity>.<verb> past tense, lowercase, dot-separated. Payload key matches the first segment of type. Renames: transcript.started/forked/.compacted (JSON value) → session.* RecordStarted/Forked/Compacted → SessionStarted/Forked/Compacted RecordMessageAppended/RecordStatePatched → MessageAppended/StatePatched SystemRecord (lifecycle payload) → SessionRecord (the struct holds provider/model/parentId/boundaryId — name now matches intent; it never carried system-prompt data) Record.System field → Record.Session Record.TranscriptID + json tag → SessionID / "sessionId" *Command.TranscriptID → SessionID ForkCommand.Source/NewTranscriptID → Source/NewSessionID ListItem.TranscriptID, fileIndexEntry.TranscriptID → SessionID ContentBlock gains a Source string field for provenance attribution (populated in a later commit). The existing inline-image data field, which used JSON tag "source", is renamed to ImageSource / "imageSource" to free the namespace. Projector's applyStatePatch now ignores unknown patch paths (default: continue) instead of returning an error — keeps the projector forward-compatible with patch paths added by later schema iterations. The error-expecting test becomes IsIgnored, asserting subsequent ops still apply.

Adds session.Recorder, a synchronous observer on core.Agent's event bus that translates lifecycle events into transcript records. Wired via core.Config.OnEvent (new BuildParams.OnEvent passthrough; constructed by Session.NewRecorder at agent start). New records per turn: inference.requested digests of system prompt / tool list / message chain inference.responded stop reason, latency, token usage System observability: Use(sec, caller), Drop(name, caller) on core.System require explicit caller strings (system:init, command:identity, subagent:init). System.SetObserver replays existing sections on attach so the event chain is complete from t0. Recorder writes system.section.added/removed. Tools observability: Same pattern on core.Tools: Add/Remove gain caller, SetObserver replays. MCP registrations pass caller="mcp:<toolname>". Both wrappers (permissionTools, progressTools) pass-through. Recorder writes tools.added (with schema) / tools.removed (with name). Content provenance: splitTextByProvenance splits user-message content on <system-reminder> boundaries and tags those blocks with Source="reminder". Round-trip safe: extractUserContent concatenates back, preserving core.Message.Content byte-for-byte. Non-blocking telemetry: agent.emitTelemetry uses select-default on the outbox so observer-fired events never deadlock the agent goroutine on a slow TUI consumer. Tests: 4 new Recorder tests + 2 provenance tests.

NewRecorder now calls FileStore.Start synchronously before returning, so session.started lands on disk first. Previously the call order 1. app builds Recorder 2. core.NewAgent wires System/Tools.SetObserver 3. SetObserver replays existing members → AppendSystemSection/AppendTools creates the transcript file 4. Store.Save → Start → file exists, no-op left provider/model/parentID metadata absent from every transcript that involved any observer replay (i.e. every main-agent session). Resume and the projects index would show empty Provider/Model. Fix lives in the recorder constructor because that's the latest point before observers can fire. RecorderOptions gains Cwd and ProjectID so Start can be called with the right metadata. Setup.NewRecorder threads them through from Store.cwd/Store.projectID. Test: TestRecorder_WritesSessionStartedBeforeTelemetry asserts that after NewRecorder() returns and an OnSystemChange event is fired, the first record on disk is session.started with provider+model populated.

A localhost-only web UI for inspecting session transcripts under ~/.gen/projects/<encoded-cwd>/transcripts/. Read-only, single binary (assets embedded via go:embed), no build step on the frontend. Backend (internal/trace): - HTTP server with three endpoints: GET / SPA shell GET /api/sessions list transcripts in the project GET /api/sessions/{id}/records paginated record fetch GET /api/sessions/{id}/stream SSE live tail - The wire format is the JSONL on disk verbatim — server doesn't reshape records. One schema, not two. - Polling-based tailer (500ms tick); no fsnotify dependency. - Path-traversal guard rejects sessionIDs containing slashes. Frontend (internal/trace/ui/assets): - Vanilla JS, ~200 lines. EventSource for SSE. - Sessions sidebar | colored timeline | JSON detail panel. - Per-group filter checkboxes (state, tools, system, inference, message). - Auto-scrolls when near the bottom; pauses on user scroll-up. CLI (cmd/gen/trace.go): - `gen trace` binds 127.0.0.1 on a random port by default, opens the browser, blocks until Ctrl-C. - --addr to pin a port; refuses any non-loopback host. - --no-open to skip the browser launch (for headless / CI). Tests cover the records endpoint, empty-project case, and the path-traversal guard. Smoke-tested locally against real session files.

Two issues in the viewer's frontend. P1 (defense-in-depth XSS): s.id from /api/sessions flows into the sidebar li's innerHTML unescaped. Today the ID is always a UUID by construction, but filesystem-derived strings should never be trusted as HTML — the title beside it is already escaped, the inconsistency suggests an oversight. Wrap s.id.slice(0, 12) in escapeHTML(). P2 (timeline duplicates on reconnect): browsers auto-reconnect EventSource on transient drops (laptop sleep, localhost blip). The server starts each stream from offset 0 and replays the entire file, so every reconnect re-emitted every record and onmessage pushed them all into state.records. After a sleep/wake the timeline doubled, tripled, etc. Fix: maintain a state.seenIDs Set keyed by each record's stable id field. Skip on duplicate. Cleared in openSession alongside records[]. The server-side replay strategy is left unchanged — a follow-up could honor Last-Event-ID to skip already-streamed bytes, but dedup-on-client is the correctness fix and lets the timeline survive reconnects today.

A first-principles description of the transcript event taxonomy, record envelope, payload shapes per group, replay algorithm, common troubleshooting recipes (jq one-liners), and the gen trace viewer's HTTP API. Companion to docs/transcriptstore.md, which covers storage layout and resume mechanics; this doc focuses on the event schema and the viewer.

yanmxa added 2 commits May 15, 2026 22:29

yanmxa force-pushed the trace/5-docs branch from 790c579 to 0ea583f Compare May 15, 2026 14:29

yanmxa force-pushed the trace/4-viewer branch from 8c8e06e to d7836dd Compare May 15, 2026 14:29

yanmxa added 2 commits May 15, 2026 23:13

yanmxa force-pushed the trace/5-docs branch from 0ea583f to 26c040d Compare May 15, 2026 15:14

yanmxa force-pushed the trace/4-viewer branch from d7836dd to dc44f17 Compare May 15, 2026 15:14

yanmxa mentioned this pull request May 15, 2026

feat: trace recorder for inference / system / tools + content provenance #21

Merged

3 tasks

yanmxa added 2 commits May 15, 2026 23:19

yanmxa force-pushed the trace/5-docs branch from 26c040d to 7d3c9ad Compare May 15, 2026 15:19

yanmxa mentioned this pull request May 15, 2026

feat: gen trace web viewer #22

Merged

2 tasks

yanmxa changed the base branch from trace/4-viewer to main May 15, 2026 15:25

yanmxa changed the title ~~docs: tracing event model~~ feat: tracing pipeline (schema rename + recorder + viewer + docs) May 15, 2026

yanmxa merged commit 6d7e5f5 into main May 15, 2026

yanmxa deleted the trace/5-docs branch May 15, 2026 15:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: tracing pipeline (schema rename + recorder + viewer + docs)#23

feat: tracing pipeline (schema rename + recorder + viewer + docs)#23
yanmxa merged 6 commits into
mainfrom
trace/5-docs

yanmxa commented May 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yanmxa commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What's in the diff (already reviewed)

Test

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

yanmxa commented May 15, 2026 •

edited

Loading