agent-memory

Persistent memory for AI coding agents. Single binary. Zero setup. Instant search.

AI agents forget everything between sessions. You repeat context, re-explain decisions, re-describe architecture. agent-memory fixes this -- it gives your agent a persistent, searchable memory backed by a single SQLite file.

go install github.com/dklymentiev/agent-memory@latest

The Problem

Every time you start a new Claude Code session, your agent starts from scratch:

"We decided to use PostgreSQL for this" -- you've said it 5 times
"Don't deploy on Fridays" -- the agent doesn't know
"The auth flow uses JWT refresh tokens" -- explained it last week, gone today

The Solution

# Save a decision once
agent-memory add "Auth uses JWT refresh tokens, 15min access / 7d refresh" \
  -t type:decision -t topic:auth

# Agent finds it when relevant
agent-memory search "authentication tokens"

# Or get smart context automatically at session start
agent-memory context

One binary. One SQLite file. No Docker. No external database. No configuration required.

Key Features

Search that works -- FTS5 full-text search with BM25 ranking. Optional hybrid search with local ONNX embeddings or OpenAI (30% keyword + 70% semantic). Finds what you need even with different wording. Local embeddings require zero API keys and zero internet.

Workspaces -- isolate memories per project. agent-memory focus backend-api and everything stays separate.

MCP server -- 14 tools for Claude Code, Cursor, or any MCP-compatible agent. One command to set up: claude mcp add agent-memory -- agent-memory mcp

Auto-capture hooks -- automatically saves tool outputs and user prompts from Claude Code sessions. Sensitive data (passwords, API keys, tokens, JWTs) is scrubbed before storage.

Smart context -- progressive disclosure system assembles the right context for each session: pinned memories first, then recent, then search results -- all within a token budget.

Also: auto-tagging, markdown-aware chunking, content dedup (SHA-256), prompt templates, timeline view, JSON/Markdown export, prompt injection protection.

Quick Start

# Add memories
agent-memory add "Use snake_case for Python, camelCase for TypeScript" \
  -t type:decision -t topic:style

# Search
agent-memory search "naming conventions"

# Switch workspace
agent-memory focus my-project

# Get session context (pinned + recent + relevant)
agent-memory context

# Pipe files into memory
cat ARCHITECTURE.md | agent-memory add -f - -t type:artifact --pin

Semantic Search (Embeddings)

By default, agent-memory uses FTS5 keyword search. Enable embeddings for hybrid search (30% keyword + 70% semantic) that understands meaning, not just exact words.

Local embeddings (recommended) -- runs entirely on your machine, no API keys, no internet after setup:

agent-memory embeddings enable --local
# Downloads ONNX Runtime (~8MB) and all-MiniLM-L6-v2 model (~87MB)
# Everything stored in ~/.agent-memory/

OpenAI embeddings -- higher quality, requires API key:

export OPENAI_API_KEY=sk-...
agent-memory embeddings enable --openai

Once enabled, search automatically uses hybrid mode:

agent-memory search "authentication flow"     # hybrid: FTS + semantic
agent-memory search "auth" --fts              # force keyword-only
agent-memory search "how do users log in" --semantic  # force semantic-only

Provider	Model	Dimensions	Speed	Cost
local	all-MiniLM-L6-v2	384	~0.4s/query	Free
openai	text-embedding-3-small	1536	~0.3s/query	~$0.02/1M tokens

MCP Server (Claude Code Integration)

# One-liner setup
claude mcp add agent-memory -- agent-memory mcp

Or add to .mcp.json in your project:

{
  "mcpServers": {
    "agent-memory": {
      "command": "agent-memory",
      "args": ["mcp"]
    }
  }
}

14 MCP tools available: memory_add, memory_search, memory_context, memory_list, memory_focus, memory_delete, memory_update, memory_stats, memory_timeline, memory_save_prompt, memory_get_prompt, memory_suggest_tags, memory_session_start, memory_session_end.

See docs/guide.md for details on each tool.

Claude Code Hooks

Auto-capture context from coding sessions:

{
  "hooks": [
    {"event": "PostToolUse", "command": "agent-memory hook post-tool-use", "timeout": 5000},
    {"event": "SessionStart", "command": "agent-memory hook session-start", "timeout": 5000},
    {"event": "UserPromptSubmit", "command": "agent-memory hook user-prompt-submit", "timeout": 5000},
    {"event": "SessionEnd", "command": "agent-memory hook session-end", "timeout": 5000}
  ]
}

Hooks scrub sensitive data (passwords, API keys, tokens, JWTs, private keys) and protect against prompt injection.

How It Compares

	agent-memory	mem0	Zep	ChromaDB
Setup	`go install`, done	Python + API key	Docker + Postgres	Python + server
Dependencies	None (single binary)	Python, OpenAI	Docker, Postgres, Redis	Python, multiple
Storage	Single SQLite file	Cloud or self-hosted	Postgres	Persistent dir
MCP support	Built-in (14 tools)	No	No	No
Search	FTS5 + optional embeddings	Embeddings only	Embeddings + graph	Embeddings only
Works offline	Yes (FTS5 mode)	No	Yes	Yes
Binary size	~11MB	N/A	N/A	N/A

agent-memory is designed for personal/small-team use with AI coding agents. If you need a production vector database for millions of documents, use ChromaDB or Pinecone. If you want something that works in 10 seconds with zero infrastructure, this is it.

Security

Database and config files: restricted permissions (0600/0700)
Hook data: sensitive patterns scrubbed before storage (12 regex rules)
Session context: prompt injection protection (XML boundaries + 17-pattern deny-list)
Search queries: FTS5 and LIKE injection sanitized
Content: 1MB limit, SHA-256 dedup, workspace name validation

See SECURITY.md for full details.

Storage

All data in one file:

~/.agent-memory/memory.db          # default (global)
.agent-memory/memory.db            # per-project (via agent-memory init)
agent-memory --db /path/to/my.db   # custom path

Documentation

User Guide -- installation, all features, configuration
Technical Reference -- all types, functions, schemas, CLI flags
Architecture -- internal design, data flow, package structure
Contributing -- development setup, code style, PR guidelines
Security -- threat model, protections, vulnerability reporting

Planned

Data retention TTL with automatic cleanup
Web UI for browsing and searching memories

Building from Source

git clone https://github.com/dklymentiev/agent-memory.git
cd agent-memory
make build      # builds ./agent-memory
make test       # runs tests
make install    # copies to /usr/local/bin

Requires Go 1.21+ (automatically downloads Go 1.25 toolchain).

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
cmd		cmd
docs		docs
internal		internal
.env.example		.env.example
.gitignore		.gitignore
.goreleaser.yml		.goreleaser.yml
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
agent-memory.exe		agent-memory.exe
go.mod		go.mod
go.sum		go.sum
hooks-example.json		hooks-example.json
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

agent-memory

The Problem

The Solution

Key Features

Quick Start

Semantic Search (Embeddings)

MCP Server (Claude Code Integration)

Claude Code Hooks

How It Compares

Security

Storage

Documentation

Planned

Building from Source

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

agent-memory

The Problem

The Solution

Key Features

Quick Start

Semantic Search (Embeddings)

MCP Server (Claude Code Integration)

Claude Code Hooks

How It Compares

Security

Storage

Documentation

Planned

Building from Source

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages