@cartisien/extensa

Vector infrastructure and Matryoshka embeddings for AI agents.

Part of the Cartisien Memory Suite — res extensa to Cogito's res cogitans and Engram's trace.

What is Extensa?

Extensa is the vector infrastructure layer for the Cartisien Memory Suite. While Engram handles memory storage and retrieval, Extensa manages the raw embedding pipeline:

Matryoshka embeddings — adaptive dimensionality (768 → 512 → 256 → 128) from a single model pass
Model management — Ollama integration, fallback chains, batch embedding
Similarity primitives — cosine, dot product, L2 distance
Caching — LRU cache for repeated embedding calls
Normalization — L2 normalization, whitening

import { Extensa } from '@cartisien/extensa';

const vec = new Extensa({ ollamaUrl: 'http://localhost:11434', model: 'nomic-embed-text' });

// Embed a single string
const embedding = await vec.embed('User prefers TypeScript');
// → number[] (768 dims by default)

// Matryoshka — get multiple sizes in one pass
const { full, half, quarter } = await vec.embedMatryoshka('User prefers TypeScript');
// → { full: number[768], half: number[384], quarter: number[192] }

// Batch
const embeddings = await vec.embedBatch(['text one', 'text two', 'text three']);

// Similarity
const score = vec.cosine(embedding, otherEmbedding);

Why Matryoshka?

Standard embeddings are fixed-size. Matryoshka Representation Learning (MRL) trains models so that the first N dimensions of a large embedding are themselves a meaningful smaller embedding. This means:

Store full 768-dim vectors for high-recall search
Use 128-dim projections for fast candidate filtering
Adaptive precision — trade accuracy for speed at query time

Extensa handles the dimension slicing, normalization, and storage recommendations automatically.

API (v0.2 — coming soon)

`new Extensa(config)`

const vec = new Extensa({
  ollamaUrl: 'http://localhost:11434',  // local Ollama
  model: 'nomic-embed-text',            // embedding model
  dimensions: 768,                       // output dimensions
  cache: true,                           // LRU cache (default: true)
  cacheSize: 1000,                       // max cached embeddings
});

`embed(text): Promise<number[]>`

Embed a single string. Returns L2-normalized vector.

`embedBatch(texts): Promise<number[][]>`

Embed multiple strings in parallel with rate limiting.

`embedMatryoshka(text): Promise<MatryoshkaResult>`

Return multiple dimension slices from a single embedding pass.

Similarity

vec.cosine(a, b)    // cosine similarity → [0, 1]
vec.dot(a, b)       // dot product
vec.l2(a, b)        // L2 distance

QA / Smoke test (Alienware RTX 5090)

Time: Thursday, March 12th, 2026 — 4:02 PM (America/New_York)
Smoke test: single embed against Ollama @ 192.168.68.73:11434
Result: dims=768, sample first 5 values = [-0.28037112951278687,-1.0087080001831055,-4.1742777824401855,-0.15757766366004944,0.9060647487640381]

Extensa compiled and returned valid embeddings in a smoke run. Next: run batch benchmarks and add CI tests.

The Cartisien Memory Suite

Package	Role	Status
`@cartisien/engram`	Persistent memory	✅ Stable
`@cartisien/cogito`	Agent lifecycle & identity	🔧 In development
`@cartisien/extensa`	Vector infrastructure	🔧 In development

"Res cogitans meets res extensa."

Research

This package is part of the Cartisien Memory Suite described in:

Cartisien. (2026). Engram: A Local-First Persistent Memory Architecture for Conversational AI Agents. Zenodo. https://doi.org/10.5281/zenodo.18988892

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
dist		dist
node_modules		node_modules
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

@cartisien/extensa

What is Extensa?

Why Matryoshka?

API (v0.2 — coming soon)

`new Extensa(config)`

`embed(text): Promise<number[]>`

`embedBatch(texts): Promise<number[][]>`

`embedMatryoshka(text): Promise<MatryoshkaResult>`

Similarity

QA / Smoke test (Alienware RTX 5090)

The Cartisien Memory Suite

Research

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

@cartisien/extensa

What is Extensa?

Why Matryoshka?

API (v0.2 — coming soon)

new Extensa(config)

embed(text): Promise<number[]>

embedBatch(texts): Promise<number[][]>

embedMatryoshka(text): Promise<MatryoshkaResult>

Similarity

QA / Smoke test (Alienware RTX 5090)

The Cartisien Memory Suite

Research

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`new Extensa(config)`

`embed(text): Promise<number[]>`

`embedBatch(texts): Promise<number[][]>`

`embedMatryoshka(text): Promise<MatryoshkaResult>`

Packages