VEKTOR Notes is a local-first AI note-taking app for Android. Everything — your notes, memories, and the AI models processing them — runs on your device. No cloud uploads. No tracking.
The interface has two modes you swipe between: JOT for capture and CHAT for retrieval. Under the hood, a four-layer memory graph (MAGMA) builds a persistent, searchable representation of everything you write.
Install from Google Play and open the app. You land directly on the JOT surface — a blank text area. No account required. No onboarding wizard. The app is immediately functional for capture and local storage without any configuration.
To enable AI features (ghost suggestions and CHAT responses), you need to add an API key from a supported LLM provider.
VEKTOR Notes uses your own provider keys. Your key is stored in SecureStore on-device — it never touches VEKTOR servers. API calls go directly from your device to the provider.
Your API key is stored in Android's encrypted SecureStore. It is never sent to VEKTOR servers. You are not paying VEKTOR for LLM usage — costs go directly to your provider.
| Provider | Best for | Cost guide |
|---|---|---|
| Anthropic Claude Haiku | CHAT — best memory synthesis quality | ~$0.003/1K tokens |
| OpenAI GPT-4o-mini | Balanced JOT + CHAT | ~$0.0002/1K tokens |
| Groq LLaMA | Fastest JOT ghost suggestions | Free tier available |
| Gemini Flash | Long context CHAT | Free tier available |
JOT is the primary writing surface. A clean text area with no menus, no formatting toolbar, no friction. You type. The app watches quietly. After 900 milliseconds of silence — long enough to fire only when you have genuinely paused, short enough that the suggestion arrives before you lose the thread — the ghost suggestion engine offers a completion.
Ghost suggestions appear as faded text beneath your current content. Two actions only:
Suggestions are deliberately capped at 30 words or fewer. If you want to develop an idea further, that is what CHAT is for.
| Action | What it does |
|---|---|
| Synthesise | Extracts key ideas and surfaces one unexpected connection. Terse, no preamble. |
| Expand | Adds one concrete example and one implication. Stays in your voice. |
| Clean | Fixes grammar, tightens prose, keeps every idea. Returns only the cleaned text. |
| Connect | Names 2–3 concepts this note links to and explains why. |
Swipe left from JOT to enter CHAT mode. Ask questions. The app searches everything you have written using a dual-channel retrieval pipeline and answers from your own context — not the internet.
Swipe up from JOT to open the memory graph. Every saved memory is a node. Every inferred relationship is an edge. Nodes are coloured by layer type.
When you ask something in CHAT, two parallel retrieval paths run before the LLM sees your question:
1/(k+rank) across all lists, k=60. Documents appearing highly in multiple lists score best.MAGMA is the four-layer graph architecture that separates VEKTOR Notes from a notes app with a chat window bolted on. Each layer represents a different type of relationship, which determines how memories are retrieved.
The entire graph lives in a single SQLite file on your device. No cloud database. No GPU. Queries execute in milliseconds and back up with your device backup.
notes -- raw note content + timestamps
memories -- MAGMA nodes (content, layer, importance)
edges -- typed directional relationships
entities -- extracted named entities
FTS5 index -- BM25 full-text search
vec_memories -- float32 embeddings (sqlite-vec ANN)
The MCP bridge connects VEKTOR Notes to Claude Desktop, Cursor, or any MCP-compatible agent on your computer over your local Tailscale network. Your phone's memory becomes available to desktop AI tools.
tailscale ip -4VEKTOR_DB_PATH=/path/to/vektor.db node vektor-notes-mcp-bridge.jsRuns on port 3747 by default.100.x.x.x:3747claude_desktop_config.json.The model picker in Settings lets you route tasks to different models. JOT ghost suggestions can run on a faster, cheaper model than CHAT synthesis — you are not paying GPT-4 rates for a 30-word autocomplete.
Recommended: Groq LLaMA for JOT (fastest, free tier available), Claude Haiku for CHAT (best memory synthesis quality at low cost).
We do not own the models this app runs on, and we never will. You configure whichever provider you want, paste in your own API key, and the app uses it directly.