NoldoMem — Long-Term Memory for OpenClaw AI Agents

Long-term memory for OpenClaw AI agents. Named after the Noldor — Tolkien's elves renowned for deep knowledge and craft.

NoldoMem replaces OpenClaw's built-in memory with a persistent, decay-aware memory system. Agents remember important things, forget trivial things over time, and learn from their mistakes — just like real memory.

Built for OpenClaw. One SQLite file per agent, no cloud DB, no Docker required.

Why NoldoMem?

OpenClaw's native memory (memorySearch) is basic — keyword search, no decay, no behavioral learning. NoldoMem adds:

Feature	OpenClaw Native	NoldoMem
Search	Keyword only	Hybrid (semantic + BM25 + reranker)
Memory decay	No	Ebbinghaus forgetting curve
Learn from mistakes	No	Lesson memories with behavioral reinforcement
Per-agent isolation	No	Separate SQLite per agent
Memory consolidation	No	Auto-compress old memories
Prompt injection protection	No	Built-in sanitization
Trust/provenance tracking	No	Source + trust_level per memory
Pattern-to-policy	No	3+ same mistake auto-escalates to rule
External dependencies	None	Embedding API (cloud or local)
Embedding options	Cloud only	Cloud (OpenRouter/OpenAI) or self-hosted

Quick Start (OpenClaw)

Step 1: Install NoldoMem

Requirements: Python 3.10+, ~200MB RAM for the API server (embedding server needs more — see Step 2).

git clone https://github.com/dorukardahan/noldo-memory.git
cd noldo-memory

python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

cp .env.example .env
# Edit .env: set AGENT_MEMORY_API_KEY (pick any strong secret)

Step 2: Set up embeddings

NoldoMem needs an embedding API (OpenAI-compatible /v1/embeddings format).

Fastest path — use a cloud API (2 lines in .env):

# In .env:
OPENROUTER_BASE_URL=https://openrouter.ai/api/v1
OPENROUTER_API_KEY=your-key
AGENT_MEMORY_MODEL=openai/text-embedding-3-large
AGENT_MEMORY_DIMENSIONS=3072

Works with OpenRouter, OpenAI, or any OpenAI-compatible embedding API.

Or run locally (recommended for privacy/cost):

./scripts/detect-hardware.sh --apply  # auto-detect best model for your hardware

Profile	Model	Download Size	RAM	Best For
minimal	EmbeddingGemma 300M	~300MB	1-2GB	Raspberry Pi, $5 VPS
light	Qwen3-Embedding-0.6B	~600MB	2-4GB	Small VPS
standard	Qwen3-Embedding-4B	~4GB	4-8GB	Mid-range server
heavy	Qwen3-Embedding-8B	~8GB	12GB+	Dedicated server

# Download model (example: standard profile)
huggingface-cli download Qwen/Qwen3-Embedding-4B-GGUF Qwen3-Embedding-4B-Q8_0.gguf --local-dir models/

# Start embedding server
llama-server --model models/Qwen3-Embedding-4B-Q8_0.gguf \
  --embedding --pooling last --host 127.0.0.1 --port 8090

# In .env:
# OPENROUTER_BASE_URL=http://127.0.0.1:8090/v1
# AGENT_MEMORY_DIMENSIONS=2560

Optional hosted reranker for better top-k ordering without loading a local cross-encoder model:

# In .env:
AGENT_MEMORY_RERANKER_API_ENABLED=true
AGENT_MEMORY_RERANKER_API_MODEL=cohere/rerank-4-pro
AGENT_MEMORY_RERANKER_API_KEY_FILE=$HOME/.openrouter_key
# Optional. Disabled by default because local fallback can be too slow for
# interactive recall on CPU-only hosts.
AGENT_MEMORY_RERANKER_API_LOCAL_FALLBACK=false

When the hosted reranker is available, NoldoMem skips local reranker prewarm and two-pass background reranking. If the key or endpoint is missing, it falls back to the local cross-encoder path at startup. If the hosted call fails at runtime, NoldoMem uses fast lexical fallback unless AGENT_MEMORY_RERANKER_API_LOCAL_FALLBACK=true is set. Cross-agent agent=all recall reranks once after merging agent results instead of calling the hosted reranker once per agent database.

Step 3: Start NoldoMem

set -a; source .env; set +a
python -m agent_memory
# API starts on http://127.0.0.1:8787
# Data stored in: ~/.agent-memory/ (or ~/.noldomem/ or legacy ~/.asuman/)
# Each agent gets its own SQLite file: memory.sqlite, memory-agent1.sqlite, etc.

Step 4: Configure OpenClaw

4a. Disable OpenClaw's built-in memory (important!):

{
  "agents": {
    "defaults": {
      "memorySearch": {
        "enabled": false
      }
    }
  }
}

4b. Enable hooks:

{
  "hooks": {
    "internal": {
      "enabled": true
    }
  }
}

4c. Install hooks:

openclaw hooks install -l "$(pwd)/hooks"

This keeps the repo as the source of truth via hooks.internal.load.extraDirs and avoids the old copied-handler drift inside ~/.openclaw/workspace/hooks/. The hook pack now ships installable sanitized handler.js files, while keeping mirrored handler.js.example references for manual workflows.

Fallback manual mode is still possible through hooks/README.md.

4d. Install the native OpenClaw plugin:

openclaw plugins install -l "$(pwd)/plugin"

The plugin gives agents explicit noldomem_recall, noldomem_store, and noldomem_pin tools. The hook pack remains responsible for lifecycle capture and bootstrap context injection. See plugin/README.md.

4e. Set the API key for hooks and plugin:

mkdir -p ~/.noldomem
echo "your-api-key-here" > ~/.noldomem/memory-api-key
chmod 600 ~/.noldomem/memory-api-key

Optional multi-workspace session discovery for maintenance/ingest utilities:

# In .env
AGENT_MEMORY_SESSIONS_ROOT="$HOME/.openclaw/agents"

4f. Restart OpenClaw to load the hooks and plugin.

Optional per-workspace policy file:

{
  "crossWorkspaceRecall": false,
  "sharedNamespaces": [],
  "dailyNotesEnabled": true
}

Path: workspace/.openclaw/noldo-memory.json

Step 5: Verify

curl -s localhost:8787/v1/health
# {"status":"ok","checks":{"storage":true,"embedding":true}}

curl -X POST localhost:8787/v1/store \
  -H "Content-Type: application/json" -H "X-API-Key: YOUR_KEY" \
  -d '{"text": "Test memory from setup", "agent": "main"}'

curl -X POST localhost:8787/v1/recall \
  -H "Content-Type: application/json" -H "X-API-Key: YOUR_KEY" \
  -d '{"query": "test", "agent": "main", "limit": 5}'

LLM Agent Integration

Put this in your agent's TOOLS.md or system prompt so the agent knows how to use NoldoMem:

## Memory API (NoldoMem)

You have access to a persistent memory system at localhost:8787.
All requests need headers:
- Content-Type: application/json
- X-API-Key: <key> (read from ~/.noldomem/memory-api-key)

### Store a memory
POST /v1/store {"text": "...", "agent": "YOUR_AGENT_ID"}
- Auto-classified as: fact, preference, rule, conversation, or lesson

### Recall memories
POST /v1/recall {"query": "...", "agent": "YOUR_AGENT_ID", "limit": 5}
- Returns relevant memories ranked by relevance + recency + importance
- Filter by type: {"memory_type": "lesson"} for lessons only

### Store a rule (max importance)
POST /v1/rule {"text": "Always run tests before commit", "agent": "YOUR_AGENT_ID"}

### What happens automatically (no action needed)
- Old memories fade over time (Ebbinghaus decay) — use them or lose them
- Lessons decay 3x slower than facts
- Repeated mistakes (3+) become permanent rules
- Your session starts with relevant memories pre-loaded (bootstrap hook)
- Feedback you give ("wrong", "don't do that") is captured as lessons

### Error responses
- 401: Invalid API key. Read key from ~/.noldomem/memory-api-key
- 404: Unknown endpoint. Check URL
- 422: Invalid request body. Check required fields (text, agent)
- 500: Server error (usually embedding server down). Retry in 5 seconds, max 2 retries

How Hooks Work

NoldoMem connects to OpenClaw through a native plugin plus 10 lifecycle hooks:

The native plugin exposes agent tools: noldomem_recall, noldomem_store, noldomem_pin.
The hook pack handles lifecycle capture, bootstrap recall, compaction snapshots, and session transitions.

Hook	When	What It Does
bootstrap-context	Session start	Recalls relevant memories + lessons, injects into agent context
realtime-capture	During chat	Detects feedback/corrections, stores as lessons
session-end-capture	Session end	Detects unverified suggestions, auto-generates lessons
after-tool-call	After tool use	Captures command outputs (allowlist-filtered)
before-compaction	Before compaction	Captures high-signal context before it is summarized away
pre-session-save	Before save	Tags session with memory metadata
post-compaction-restore	After compaction	Re-injects critical memories lost in context compaction
subagent-complete	Sub-agent done	Captures sub-agent results
claim-scanner	After replies	Logs unverified feature/config claims for audit
message-recall	User message	Optional mid-conversation recall hook

Each hook has a HOOK.md in hooks/ explaining its behavior and configuration.

Data Classification

NoldoMem uses two complementary classification fields:

Field	Purpose	Values	Set by
`category`	Who wrote it / where it came from	`user`, `assistant`, `qa_pair`, `decision`, `lesson`, `rule`, `other`	Hook at capture time
`memory_type`	Canonical semantic type	`fact`, `preference`, `rule`, `conversation`, `lesson`, `other`	API auto-classifier

category reflects the message origin or operational label — a user message, an assistant response, a paired Q&A exchange, or a hook-provided label such as decision.

memory_type stays intentionally small so API validation, DB filters, and search ranking never drift apart. Operational concepts such as incidents, deployments, config changes, and decisions should remain in category, source, namespace, or the memory text itself, not in memory_type.

Both fields are used independently. memory_type drives type-specific bonuses in hybrid search (e.g., lessons get +0.35/60 RRF boost). category is available for reporting and operational labeling but does not expand the public memory_type enum.

Architecture

graph TD
    A[OpenClaw Agent] <-->|lifecycle events| B(Hooks)
    B -->|store/recall HTTP| C{NoldoMem API :8787}
    C -->|read/write| D[(SQLite per agent)]
    C -->|embed text| E{Embedding Server :8090}
    B -->|bootstrap-context| A
    B -->|realtime-capture| C
    B -->|session-end-capture| C

Agent Session
    |
    +-- bootstrap-context hook -----> NoldoMem /v1/recall --> inject memories into context
    |
    +-- [conversation happens] -----> realtime-capture hook --> /v1/store (lessons)
    |
    +-- [tool calls] ---------------> after-tool-call hook --> /v1/store (outputs)
    |
    +-- session-end-capture hook ---> /v1/store (unverified suggestions as lessons)
    |
    +-- pre-session-save hook ------> tag session metadata

API Reference

Endpoint	Method	Auth	Description
`/v1/health`	GET	No	Health check
`/v1/health/deep`	GET	Yes	DB integrity, embedding, disk
`/v1/store`	POST	Yes	Store a memory
`/v1/recall`	POST	Yes	Hybrid search
`/v1/capture`	POST	Yes	Batch ingest (max 200 messages)
`/v1/rule`	POST	Yes	Store rule (importance=1.0)
`/v1/forget`	DELETE	Yes	Soft-delete
`/v1/pin`	POST	Yes	Pin (protect from decay)
`/v1/unpin`	POST	Yes	Unpin
`/v1/decay`	POST	Yes	Run Ebbinghaus decay
`/v1/consolidate`	POST	Yes	Deduplicate + archive
`/v1/compress`	POST	Yes	Summarize old memories
`/v1/gc`	POST	Yes	Purge soft-deleted
`/v1/amnesia-check`	POST	Yes	Check memory coverage
`/v1/stats`	GET	Yes	DB statistics
`/v1/agents`	GET	Yes	List agent DBs
`/v1/metrics`	GET	Yes	Operational metrics
`/v1/metrics/lessons`	GET	Yes	Lesson effectiveness
`/v1/export`	GET	Yes	Export as JSON
`/v1/import`	POST	Yes	Import (max 500)
`/v1/admin/rotate-key`	POST	Admin	Rotate API key

All endpoints accept ?agent=<id> for per-agent routing. POST /v1/capture and POST /v1/store also support namespace in JSON body (recommended for session-scoped memory isolation).

Search Architecture

Query -> Semantic (0.50) -> sqlite-vec cosine KNN
      -> Keyword  (0.25) -> FTS5 BM25
      -> Recency  (0.10) -> exp(-0.01 * days)
      -> Strength (0.07) -> Ebbinghaus retention
      -> Importance(0.08) -> write-time score
      |
      RRF fusion (k=60) -> Primary reranker (top-10) -> Background reranker (top-3)

Reranking has two modes:

Local cross-encoder: default, uses sentence-transformers models.
API reranker: set AGENT_MEMORY_RERANKER_API_ENABLED=true and provide AGENT_MEMORY_RERANKER_API_KEY or AGENT_MEMORY_RERANKER_API_KEY_FILE. API reranking handles the primary pass and can fall back to the local cross-encoder at runtime if the hosted call fails.

Production Deployment

systemd

sudo cp noldo-memory.service.example /etc/systemd/system/noldo-memory.service
# Edit: paths, User, EnvironmentFile
sudo systemctl enable --now noldo-memory

Cron (recommended)

See crontab.example. Key jobs: daily decay, weekly consolidation + GC, 6-hourly embedding backfill, daily SQLite backup.

Security checklist

AGENT_MEMORY_HOST=127.0.0.1 (never 0.0.0.0)
API key file: permissions 600
Data directory: permissions 700
memorySearch.enabled: false in openclaw.json
hooks.internal.enabled: true in openclaw.json
Embedding server localhost only
Backup cron active

Docker

cp .env.example .env && mkdir -p models
# Download embedding model into models/
docker compose up -d

Configuration

All config via environment variables. See .env.example for full list.

Tests

pip install -r requirements-dev.txt
python -m pytest tests/ -v
ruff check agent_memory/

Memory Quality Audit

Run a read-only SQLite audit without printing memory text:

python scripts/audit_memory_quality.py --db ~/.agent-memory/memory.sqlite
python scripts/audit_memory_quality.py --json

The audit reports aggregate counts for vectorless rows, invalid memory types, duplicate text groups, very long/short rows, namespace distribution, and secret-like patterns. It prints hashed row identifiers only, never memory content.

Secret Value Policy

NoldoMem should remember where credentials live and what they are used for, not the raw credential value. This keeps recall useful while reducing the chance of a token leaking into chat, logs, exports, or backups.

Audit first:

python scripts/audit_memory_quality.py --json
python scripts/redact_memory_secrets.py --db ~/.agent-memory/memory.sqlite --json

Apply redaction only after reviewing the aggregate dry-run counts:

python scripts/redact_memory_secrets.py --db ~/.agent-memory/memory.sqlite --apply
python scripts/backfill_vectors.py --agent all --batch-size 2 --max-sub-batch 1

The redaction script creates a SQLite backup before applying changes. It replaces secret-like values with placeholders, redacts original_text when present, invalidates vectors for changed searchable text, and never prints memory content or secret values.

FAQ

Can I use NoldoMem without OpenClaw? Yes. NoldoMem is a standalone REST API. Any application that can make HTTP requests can store and recall memories. The hooks are OpenClaw-specific, but the API works with anything.

How much disk space do memories use? Roughly 1-2 KB per memory (text + metadata + embedding vector). 10,000 memories take about 15-20 MB. SQLite with WAL mode handles concurrent access well.

What happens if the embedding server goes down? NoldoMem continues to work in degraded mode — keyword search (BM25) still works, but semantic search returns no results. The /v1/health endpoint reports "embedding": false. Memories stored without embeddings get auto-embedded when the server comes back (via the backfill worker).

Can multiple agents share memories? Each agent has its own isolated SQLite database by design. Use ?agent=<id> to route requests. Shared reads are explicit opt-in: agent=all enables cross-agent recall/search style operations, while writes still stay scoped to one agent or namespace for safety.

How do I force a memory type? Pass "memory_type": "rule" (or fact/preference/lesson/conversation) in your /v1/store request to override auto-classification.

Troubleshooting

Problem	Cause & Fix
`curl` returns "Unauthorized"	Wrong API key. Check `~/.noldomem/memory-api-key` matches `AGENT_MEMORY_API_KEY` in `.env`
`/v1/health` shows `"embedding": false`	Embedding server not running or wrong URL. Check `OPENROUTER_BASE_URL` in `.env`
Agent doesn't remember anything	Hooks not loading. Verify `hooks.internal.enabled: true` in `openclaw.json` and restart OpenClaw
`vector dimension mismatch` error	Changed embedding model without reindexing. Run `.venv/bin/python scripts/reindex_embeddings.py`
Port 8787 already in use	Set `AGENT_MEMORY_PORT=8788` (or any free port) in `.env`
Memories disappearing too fast	Decay is too aggressive. Adjust decay cron frequency in crontab, or pin critical memories via `/v1/pin`
`sqlite3.OperationalError: database is locked`	Concurrent writes. NoldoMem handles this with WAL mode, but check for external tools accessing the DB

Upgrading

cd /path/to/noldo-memory
git pull origin main
.venv/bin/pip install -r requirements.txt
# Compare .env.example with your .env for new variables
# Database migrations run automatically on startup
sudo systemctl restart noldo-memory

Contributing

See CONTRIBUTING.md.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 241 Commits
.github		.github
agent_memory		agent_memory
docs		docs
hooks		hooks
migrations		migrations
plugin		plugin
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
FEATURES.md		FEATURES.md
LICENSE		LICENSE
README.md		README.md
agent-memory.service.example		agent-memory.service.example
crontab.example		crontab.example
docker-compose.yml		docker-compose.yml
embedding-server.service.example		embedding-server.service.example
noldo-memory.service.example		noldo-memory.service.example
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.lock		requirements.lock
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NoldoMem — Long-Term Memory for OpenClaw AI Agents

Why NoldoMem?

Quick Start (OpenClaw)

Step 1: Install NoldoMem

Step 2: Set up embeddings

Step 3: Start NoldoMem

Step 4: Configure OpenClaw

Step 5: Verify

LLM Agent Integration

How Hooks Work

Data Classification

Architecture

API Reference

Search Architecture

Production Deployment

systemd

Cron (recommended)

Security checklist

Docker

Configuration

Tests

Memory Quality Audit

Secret Value Policy

FAQ

Troubleshooting

Upgrading

Contributing

License

About

Uh oh!

Releases 67

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NoldoMem — Long-Term Memory for OpenClaw AI Agents

Why NoldoMem?

Quick Start (OpenClaw)

Step 1: Install NoldoMem

Step 2: Set up embeddings

Step 3: Start NoldoMem

Step 4: Configure OpenClaw

Step 5: Verify

LLM Agent Integration

How Hooks Work

Data Classification

Architecture

API Reference

Search Architecture

Production Deployment

systemd

Cron (recommended)

Security checklist

Docker

Configuration

Tests

Memory Quality Audit

Secret Value Policy

FAQ

Troubleshooting

Upgrading

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 67

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages