Local-first vector memory with a self-organising 4-layer graph. Spec-decoding retrieval. Zettelkasten-linked edges.
Conversation turns, tool outputs, observations. Any unstructured agent context fed in as text.
Every memory is evaluated: ADD new info, UPDATE existing, DELETE contradictions, or NO_OP if already known. Zero duplicates.
Persisted across 4 graph types in SQLite. Survives all session resets. REM cycle compresses while idle.
Similarity between memories. Finds related concepts across your full context history.
Cause → Effect relationships. Understands why things happened, not just what.
Before → After sequences. Tracks how knowledge evolves and decays over time.
Named entity co-occurrence. Connects people, projects, and events automatically.
Most vector stores are passive. They store what you put in and return what you ask for. VEKTOR is an active memory layer — it evolves, curates, and reasons about what your agent should remember.
Standard RAG vector stores
VEKTOR Memory
Drop-in memory layer for LangChain agents.
recall() returns context, remember() stores.
v1 + v2 adapters included.
Persistent memory for OpenAI agent loops.
Recalled context injected into system prompt.
GPT-4o and o-series models supported.
Full MCP module — vektor_recall, vektor_store,
vektor_graph, vektor_delta tools.
Connect Claude Desktop in minutes.
Provider-agnostic single config switch.
Key pooling for Gemini — up to 9 API keys,
waterfall rotation, zero rate-limit downtime.
vektor_memoire HTTP tool for Le Chat
and Mistral API agents. Local bridge on
localhost:3847. French-first sovereign memory.
53-tool MCP layer for Claude Desktop.
Stealth browser, credential vault, CAPTCHA solving,
behaviour injection. Zero cloud. One install.
Your agent now knows why memories are connected — not just what. Four-phase causal engine: G-Formula, MSM/IPW, IV Bounds, and Root Cause Analysis. Traces agent failures backwards through the causal chain, scores root causes by impact, and predicts the fix. No other memory layer does this.
G-Formula · MSM/IPW · IV Bounds · RCADeep research that never goes off-script. 8-step deterministic pipeline replaces the old unbounded loop: DECOMPOSE → VAULT-FIRST → SWEEP → LOCI → COMMIT → ADVERSARIAL → SYNTHESISE → CRITIC+PATCH. Every run is auditable, reproducible, and hallucination-resistant.
deep:true · adversarial_search · loci_rank · patchTwo-pass whitepaper generation via Groq LLaMA with APA7 citation infrastructure. Your notes surface relevant memories as you write. Ghost-text autocomplete, briefing scheduler, post-generation citation scanner. Long-form thinking that lives alongside your agent — all local, all yours.
Notes RAG · Two-pass · APA7 · BriefingIntegrated notes layer with TAG pill, notes RAG, and two-pass article generation via Groq LLaMA. APA7 citation infrastructure, post-generation citation scanner, ghost-text autocomplete, and briefing scheduler. Notes live alongside memories in local SQLite — never leaves your machine.
Notes · RAG · Synthesis · Citations · Briefing
// 1. Install
// npm install vektor-slipstream
import { createMemory } from 'vektor-slipstream';
// 2. Initialise
const memory = await createMemory({
provider: 'gemini',
apiKey: process.env.GEMINI_API_KEY,
agentId: 'my-agent',
dbPath: './my-agent.db',
});
// 3. Remember — AUDN decides ADD/UPDATE/DELETE
await memory.remember("User prefers TypeScript");
// 4. Recall
const ctx = await memory.recall("coding preferences");
// 5. Traverse the graph
const g = await memory.graph("TypeScript", { hops: 2 });
// 6. What changed in 7 days?
const d = await memory.delta("architecture", 7);
Pure SQLite. No cloud dependency, no API keys for memory. Your memory graph never leaves your server. LLM providers process queries per their own privacy policies.
Claude, Gemini, Groq, Mistral, OpenAI, Ollama, OpenRouter. Switch provider with one config change. Key pooling for Gemini — waterfall rotation across up to 9 keys.
Automatic curation loop prevents contradictions and duplicates. The graph stays consistent without any manual management.
Background process compresses 50 fragments into 3 core insights. Runs while your agent is idle. Run via vektor rem from the CLI.
The graph type system underpinning VEKTOR's four memory layers — semantic, causal, temporal, and entity.
GRAPH ARCHITECTURE ARXIV // 2601.02163Self-organizing memory architecture for structured long-horizon reasoning. Informs VEKTOR's REM cycle approach to memory consolidation and lifecycle management across extended agent sessions.
MEMORY LIFECYCLE ARXIV // 2504.19413Scalable long-term memory via dynamic extraction, consolidation, and graph-based retrieval. Informs VEKTOR's AUDN curation loop and REM compression cycle. Mem0 benchmarks show 90%+ token cost reduction — consistent with VEKTOR's synthesis approach.
MEMORY COMPRESSION LETTA.COMThe memory-as-OS paradigm that inspired VEKTOR's approach to agent context management and recall.
AGENT OSCloud memory APIs charge twice: a subscription for the service, and an embedding API fee on every single store and recall operation. Those embedding calls add up fast — at production agent volume they often exceed the subscription itself. VEKTOR runs on your machine and routes through the LLM provider you already pay for. No second bill. No hidden meter.
Cross-standard vector DB migration. Export, import, and migrate agent memory between any vector store using the open .vmig.jsonl interchange format. One file. Any store. No lock-in.
npx vex migrate --from vektor --to qdrant
| STORE | EXPORT | IMPORT |
|---|---|---|
| vektor | ✓ | ✓ |
| jsonl | ✓ | ✓ |
| pinecone | ✓ | ✓ |
| qdrant | ✓ | ✓ |
| chroma | ✓ | ✓ |
| weaviate | ✓ | ✓ |
| pgvector | ✓ | ✓ |
| redis | ✓ | ✓ |
| milvus | ✓ | ✓ |
| neo4j | ✓ | ✓ |
| claude-export | ✓ | — |
| chatgpt-export | ✓ | — |
Translate vectors between embedding model spaces using pre-trained linear projection weights — no API calls, no re-embedding, pure matrix multiply. Switch models without losing your memory.
npm install -g @vektormemory/vex-adapter
Keep your MCP server configurations in sync across every AI editor you use. One source of truth for all your mcp.json configs. Edit once, sync everywhere. No drift, no duplication.
npm install -g @vektormemory/vek-sync
Your MCP credentials — API keys, tokens, secrets — are encrypted at rest using AES-256-GCM with OS-bound key derivation. No plaintext config files. No secrets in git. Credentials travel with the sync, not around it.
| EDITOR | CONFIG PATH | SYNC |
|---|---|---|
| Claude Desktop | Claude Desktop app | ✓ |
| Cursor | Cursor editor | ✓ |
| VS Code | .vscode/mcp.json | ✓ |
| Windsurf | Windsurf by Codeium | ✓ |
| Claude Code | Claude Code CLI | ✓ |
| Cline | saoudrizwan.claude-dev | ✓ |
| Roo Code | rooveterinaryinc.roo-cline | ✓ |
| Gemini | Gemini CLI | ✓ |
| Copilot | GitHub Copilot CLI | ✓ |
| Continue | continue.continue | ✓ |
| Codex | Codex CLI — TOML | ✓ |
How Vek-Sync eliminates config drift across every AI editor on your machine.
Route context, tasks, and memory across every AI tool you use. Connect Claude, Cursor, Windsurf, ChatGPT, and LangChain to a shared bus — so your work follows you across every tool, every session, every machine.
npm install -g @vektormemory/via
Via scans your project and builds a token-aware file anatomy index. Every connected agent gets instant context — no manual briefing, no re-explaining. Drop it into any project and every tool knows where everything is.
| TOOL | DESCRIPTION |
|---|---|
| Claude | Shared context + memory bus |
| Cursor | Codebase graph + task routing |
| Windsurf | Session context sync |
| ChatGPT | Cross-tool memory handoff |
| LangChain | Agent context injection |
| File watcher | Auto-index on change |
| Scaffold | Project structure templates |
| File convert | Format conversion MCP tool |
Route context and tasks across Claude, Cursor, Windsurf, ChatGPT, and LangChain from one shared bus.
No cloud. No embedding bill. No data handshake.
VEKTOR runs on your machine, under your control, permanently.
Your memory graph is a portable SQLite file — no lock-in, ever.