You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
default Hippocampus consolidation to raw memory so background memory work does not consume the visible chat inference lane
keep semantic compression available only behind CONTINUUM_ENABLE_LLM_MEMORY_SYNTHESIS
move Hippocampus to the lowest background cadence, pause it during startup autonomous-work gating, and require strict background backpressure for opt-in semantic synthesis
Validation
npx vitest run tests/unit/memory/HippocampusConsolidationPolicy.test.ts tests/unit/memory/RawMemoryAdapter.test.ts tests/unit/memory/SemanticCompressionAdapter.test.ts --no-color
LGTM — small surgical fix that targets exactly the 'Compressed Insight' canned-text leak we observed when memory consolidation was sharing the visible-chat inference engine.
Verified
Default consolidation = RAW (no LLM): selectDefaultConsolidationAdapter reads getDefaultConsolidationMode() which defaults to 'raw' unless CONTINUUM_ENABLE_LLM_MEMORY_SYNTHESIS=1. RawMemoryAdapter is a real pass-through (WorkingMemoryEntry → MemoryEntity, no synthesis call) — memory still gets stored, just without the LLM step that was bleeding into chat replies.
Priority 'low' → 'lowest': doc says "Background memory must not compete with visible chat turns." Belt-and-suspenders.
Hippocampus.tick short-circuits on StartupAutonomousWorkGate.isPaused(): same gate as Stabilize startup persona backpressure #1058. Memory consolidation deferred during seed/cold-start, consistent with the autonomous-loop pattern.
SemanticCompressionAdapter BackpressureService check upgraded 'low' → 'background': when LLM synthesis IS opted in, it now uses the strictest back-off lane.
Test coverage gap: the policy function is tested but the larger behavior changes — RawMemoryAdapter selection at construction, 'lowest' priority, StartupAutonomousWorkGate.isPaused() short-circuit, 'background' backpressure level — aren't directly asserted in tests. The first one in particular is the user-visible behavior change. A simple Hippocampus constructor test (env unset → adapter is RawMemoryAdapter; env=1 → SemanticCompressionAdapter) would lock the wiring in. Worth a follow-up.
'lowest' priority value: assuming the PersonaContinuousSubprocess priority enum supports it (TS would catch otherwise — the prepush passed). Worth checking the enum has lowest distinctly below low so this isn't a no-op rename.
Documentation: when CONTINUUM_ENABLE_LLM_MEMORY_SYNTHESIS=1, what's the user-facing behavior difference? Briefly: (a) memory entries become semantic summaries instead of raw thoughts, (b) consumes inference budget that could otherwise serve chat. Worth a one-line operator note in the env var docs.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Validation