How do I install Vektor Memory?

Install Vektor Memory with a single command: npx mdskills install Vektor-Memory/vektor-memory. This downloads the skill files into your project and your AI agent picks them up automatically.

What platforms support Vektor Memory?

Vektor Memory works with Claude Code, Claude Desktop, Cursor, Vscode Copilot, Windsurf, Continue Dev, Gemini Cli, Amp, Roo Code, Goose. Skills use the open SKILL.md format which is compatible with any AI coding agent that reads markdown instructions.

← Back to MCP servers

Vektor Memory

Name: Vektor Memory: AI Agent Skill
Brand: Vektor-Memory
Availability: InStock
Rating: 3.6 (1 reviews)
Author: Vektor-Memory

Verified

MCP ServerProductivityIntermediate

Hardware-accelerated persistent memory for AI agents. Local-first. No cloud. One-time payment. 66.9% on LoCoMo benchmark (adjusted). Under 1ms retrieval. Zero cloud dependency. Retrieval pipeline rebuilt from scratch. - bge-small-en-v1.5 bi-encoder + ms-marco cross-encoder reranker (spec-decode architecture) - BM25 + Porter-stemmed BM25 + named entity injection, fused via RRF - MAGMA graph layer —

8.7advisor438popularityby getvektor

npx mdskills install Vektor-Memory/vektor-memory

Skill Advisor8.7

Hardware-accelerated local memory system with comprehensive MCP tools, strong benchmarks, and excellent documentation

+Provides 44 well-documented MCP tools covering memory, web scraping, SSH, CAPTCHA, and multimodal capabilities
+Delivers impressive sub-1ms retrieval with 66.9% LoCoMo benchmark score using local ONNX models
+Includes extensive setup instructions, CLI commands, integration guides for Claude Desktop/Code, and clear API examples
-Commercial license requirement and setup complexity may limit adoption and testing
-Declared permissions are maximally broad but legitimately needed for the comprehensive toolset

SKILL.md

1# vektor-slipstream
2
3Hardware-accelerated persistent memory for AI agents. Local-first. No cloud. One-time payment.
4
5[![npm](https://img.shields.io/npm/v/vektor-slipstream)](https://www.npmjs.com/package/vektor-slipstream)
6[![downloads](https://img.shields.io/npm/dw/vektor-slipstream)](https://www.npmjs.com/package/vektor-slipstream)
7[![license](https://img.shields.io/badge/license-Commercial-blue)](https://vektormemory.com/product#pricing)
8
9**66.9% on LoCoMo benchmark (adjusted). Under 1ms retrieval. Zero cloud dependency.**
10
11---
12
13## Install
14
15```bash
16npm install vektor-slipstream
17npx vektor setup
18```
19
20## Quick Start
21
22```js
23const { createMemory } = require('vektor-slipstream');
24
25const memory = await createMemory({
26  agentId:    'my-agent',
27  licenceKey: process.env.VEKTOR_LICENCE_KEY,
28});
29
30// Store a memory
31await memory.remember('User prefers TypeScript over JavaScript');
32
33// Recall by semantic similarity — sub-1ms, fully local
34const results = await memory.recall('coding preferences', 5);
35// → [{ content, score, id }]
36
37// Traverse the MAGMA graph
38const graph = await memory.graph('TypeScript', { hops: 2 });
39
40// What changed in 7 days?
41const delta = await memory.delta('project decisions', 7);
42
43// Morning briefing
44const brief = await memory.briefing();
45
46// Graph stats
47const stats = memory.graphStats();
48// → { nodes, edges, entities }
49```
50
51---
52
53## What's New in v1.5.0
54
55**Retrieval pipeline rebuilt from scratch.**
56
57- bge-small-en-v1.5 bi-encoder + ms-marco cross-encoder reranker (spec-decode architecture)
58- BM25 + Porter-stemmed BM25 + named entity injection, fused via RRF
59- MAGMA graph layer — co-occurrence and temporal edges between entities in SQLite
60- Persistent entity index (`vektor_entities`) for guaranteed named-entity recall
61- Foresight extraction — future-tense statements stored for temporal queries
62- Question type classifier — routes single-hop vs multi-hop to optimal retrieval path
63- ADD-only contradiction detection — conflicting facts survive with timestamps (no silent deletes)
64- Agentic sufficiency check — reformulates query if key entities missing from top results
65
66**LoCoMo benchmark results (conv 0, 154 valid questions):**
67
68| Category | Judge Accuracy |
69|---|---|
70| Multi-hop | 79.1% |
71| Adversarial | 70.4% |
72| Temporal | 46.2% |
73| Single-hop | 51.6% |
74| **Adjusted total** | **66.9%** |
75
76#Under 1ms retrieval latency with zero cloud API calls at query time.
77
78---
79
80## CLI Chat — Persistent Memory Terminal
81
82Chat with any LLM with full memory across every session. Zero configuration.
83
84```bash
85npx vektor chat                          # start chat (auto-detects Ollama)
86npx vektor chat --provider claude        # use Anthropic Claude
87npx vektor chat --provider groq --model llama-3.3-70b-versatile
88npx vektor chat --provider gemini
89npx vektor chat --provider openai
90```
91
92### Providers
93
94| Provider | Details |
95|---|---|
96| `ollama` | Default — free, local, no API key. Auto-detects best installed model. |
97| `claude` | Anthropic Claude — set `ANTHROPIC_API_KEY` |
98| `openai` | OpenAI GPT — set `OPENAI_API_KEY` |
99| `groq` | Groq LLaMA — set `GROQ_API_KEY` (free tier available) |
100| `gemini` | Google Gemini — set `GEMINI_API_KEY` |
101
102Set a permanent default:
103```bash
104# Windows
105$env:VEKTOR_PROVIDER = "claude"
106
107# macOS/Linux
108export VEKTOR_PROVIDER=claude
109```
110
111### In-chat commands
112
113Type `/` to see available commands with autocomplete. Tab to select, arrow keys to navigate.
114
115| Command | Action |
116|---|---|
117| `/recall <query>` | Search MAGMA memory mid-conversation |
118| `/stats` | Show memory node count, edges, pinned |
119| `/briefing` | Generate memory briefing inline |
120| `/exit` | Exit chat (Ctrl+C also works) |
121
122### One-liner commands
123
124```bash
125# Store a fact
126npx vektor remember "I prefer TypeScript over JavaScript"
127npx vektor remember "deadline is Friday" --importance 5
128
129# Pipe support
130cat meeting-notes.txt | npx vektor remember
131
132# One-shot recall + LLM answer
133npx vektor ask "what stack am I using?"
134npx vektor ask "what did we decide about the database?"
135
136# Autonomous goal executor
137npx vektor agent "summarise everything I know about project Alpha"
138npx vektor agent "research AI memory tools" --steps 15 --provider groq
139```
140
141### Ollama auto-detection
142
143VEKTOR queries `http://localhost:11434/api/tags` and picks the best available model:
144`qwen3` → `qwen2` → `llama` → `mistral` → first available.
145
146Override:
147```bash
148$env:OLLAMA_MODEL = "qwen3.5:4b"
149export OLLAMA_MODEL=qwen3.5:4b
150```
151
152---
153
154## All CLI Commands
155
156```bash
157npx vektor setup      # First-run wizard — licence, hardware, integrations
158npx vektor activate   # Activate licence key on this machine
159npx vektor test       # Test memory engine with progress bar
160npx vektor status     # System health check
161npx vektor mcp        # Start Claude Desktop MCP server
162npx vektor rem        # Run REM dream cycle
163npx vektor chat       # Persistent memory chat (all LLMs)
164npx vektor remember   # Store a fact
165npx vektor ask        # Query memory + LLM answer
166npx vektor agent      # Autonomous goal executor
167npx vektor help       # All commands
168```
169
170---
171
172## Claude Desktop Extension (DXT)
173
174Install the `.dxt` extension for zero-config memory in every Claude Desktop session.
175
176**Install:** drag `vektor-slipstream.dxt` onto the Claude Desktop Extensions page.
177
178Once installed, Claude automatically:
179- Recalls relevant context at session start
180- Stores facts and decisions during conversation
181- Summarises at session end
182
183All 44 tools are available in Claude Desktop — no configuration needed beyond your licence key.
184
185**User config fields:**
186
187| Field | Purpose |
188|---|---|
189| `licence_key` | Your Polar licence key (required) |
190| `db_path` | Memory DB path (defaults to `~/vektor-slipstream-memory.db`) |
191| `project_path` | Default path for `cloak_cortex` project scanning (optional) |
192
193Download the latest `.dxt` from [vektormemory.com/docs/dxt](https://vektormemory.com/docs/dxt).
194
195---
196
197## MCP Tools — All 44
198
199### Memory Tools
200
201| Tool | Function |
202|---|---|
203| `vektor_recall` | Semantic + BM25 + graph search across MAGMA memory |
204| `vektor_recall_rrf` | BM25+RRF dual-channel recall with cross-encoder rerank |
205| `vektor_store` | Store memory with importance score |
206| `vektor_ingest` | Batch ingest conversation turns with session date |
207| `vektor_graph` | Traverse associative memory graph |
208| `vektor_delta` | See what changed on a topic over time |
209| `vektor_briefing` | Generate morning briefing from recent memories |
210| `vektor_stats` | Memory DB stats — node count, edges, entities |
211| `vektor_graph_stats` | MAGMA graph node/edge/entity counts |
212| `vektor_timeline` | Query memories by date range |
213
214### CLOAK Core
215
216| Tool | Function |
217|---|---|
218| `cloak_fetch` | Stealth headless browser fetch via Playwright |
219| `cloak_fetch_smart` | Checks `llms.txt` first, falls back to stealth browser |
220| `cloak_render` | Full CSS/DOM layout sensor |
221| `cloak_diff` | Semantic diff of URL since last fetch |
222| `cloak_diff_text` | Structural diff between two text blobs |
223| `cloak_passport` | AES-256-GCM credential vault (get/set/delete/list) |
224| `cloak_ssh_exec` | Execute commands on remote server via SSH |
225| `cloak_ssh_upload` | Upload file to remote server via SFTP |
226| `tokens_saved` | Token efficiency ROI calculator |
227
228### Identity Tools
229
230| Tool | Function |
231|---|---|
232| `cloak_identity_create` | Create persistent browser fingerprint identity |
233| `cloak_identity_use` | Apply saved identity to a fetch call |
234| `cloak_identity_list` | List all saved identities with trust summary |
235
236### Behaviour Tools
237
238| Tool | Function |
239|---|---|
240| `cloak_inject_behaviour` | Human mouse/scroll injection for reCAPTCHA/Cloudflare bypass |
241| `cloak_behaviour_stats` | List available patterns and categories |
242| `cloak_load_pattern` | Load custom recorded behaviour pattern |
243| `cloak_pattern_stats` | Self-improving pattern store tier breakdown |
244| `cloak_pattern_list` | List patterns with scores and tier |
245| `cloak_pattern_prune` | Remove stale/low-scoring patterns |
246| `cloak_pattern_seed` | Seed store with built-in patterns |
247
248### CAPTCHA Tools
249
250| Tool | Function |
251|---|---|
252| `cloak_detect_captcha` | Detect CAPTCHA type and sitekey |
253| `cloak_solve_captcha` | Solve via vision AI (Claude/GPT-4o/2captcha) |
254
255### Compression and Cortex Tools
256
257| Tool | Function |
258|---|---|
259| `turbo_quant_compress` | PolarQuant vector compression (~75% smaller) |
260| `turbo_quant_stats` | Compression ratio and savings stats |
261| `cloak_cortex` | Scan project directory into MAGMA entity graph |
262| `cloak_cortex_anatomy` | Get cached file anatomy without rescanning |
263
264### Multimodal Tools
265
266| Tool | Function |
267|---|---|
268| `vektor_text` | Text generation across providers (OpenAI/Claude/Groq/Gemini/NVIDIA NIM) |
269| `vektor_image` | Image generation (DALL-E, Stability, NVIDIA) |
270| `vektor_vision` | Image understanding and analysis |
271| `vektor_speech` | Text-to-speech and transcription |
272| `vektor_search` | Web search with memory integration |
273| `vektor_providers` | List available multimodal providers and status |
274
275### Agent Tools
276
277| Tool | Function |
278|---|---|
279| `vektor_agent_run` | Run autonomous goal executor with memory |
280| `vektor_swarm` | Launch multi-agent swarm task |
281| `vektor_watch` | File system watcher — auto-ingest on change |
282
283---
284
285## Claude Code Setup
286
287Add to `.claude/settings.json` in your project:
288
289```json
290{
291  "mcpServers": {
292    "vektor": {
293      "command": "node",
294      "args": ["/path/to/node_modules/vektor-slipstream/index.js"],
295      "env": {
296        "VEKTOR_LICENCE_KEY": "your-licence-key",
297        "CLOAK_PROJECT_PATH": "/path/to/your/project"
298      }
299    }
300  }
301}
302```
303
304All 44 tools are available in Claude Code via this config.
305
306---
307
308## What's Included
309
310### Memory Core (MAGMA)
311
312- 4-layer associative graph — semantic, causal, temporal, entity
313- MAGMA graph bridge — co-occurrence and temporal edges in SQLite (`vektor-magma-bridge.js`)
314- bge-small-en-v1.5 bi-encoder + ms-marco cross-encoder reranker (`vektor-embedder.js`)
315- BM25 + stemmed BM25 + RRF fusion — keyword + semantic dual-channel recall
316- Persistent entity index — guaranteed named-entity retrieval
317- Foresight extraction — future-tense statements stored with temporal metadata
318- ADD-only contradiction detection — full history preserved, no silent overwrites
319- AUDN curation loop — zero contradictions, zero duplicates
320- REM dream cycle — up to 50:1 compression
321- Sub-1ms recall — local SQLite, no network required
322- Local ONNX embeddings — $0 embedding cost, no API key required
323
324### Integrations
325
326- **Claude Desktop** — DXT extension, 44 tools, auto-memory system prompt
327- **Claude Code** — MCP server, all 44 tools
328- **CLI** — `chat`, `remember`, `ask`, `agent` commands
329- **LangChain** — v1 + v2 adapter included
330- **OpenAI Agents SDK** — drop-in integration
331- **Gemini · Groq · Ollama · NVIDIA NIM** — provider agnostic
332
333---
334
335## Performance
336
337| Metric | Value |
338|---|---|
339| Recall latency | sub-1ms (local SQLite + ONNX) |
340| Embedding cost | $0 — fully local ONNX |
341| Embedding latency | ~10ms GPU / ~25ms CPU |
342| LoCoMo benchmark | 66.9% adjusted judge accuracy |
343| vs Mem0 | beats Mem0 old algorithm (62.47%) |
344| First run | ~2 min (downloads ~25MB model once) |
345| Subsequent boots | <100ms |
346
347## Hardware Auto-Detection
348
349Zero config. VEKTOR detects and uses the best available accelerator:
350
351- **NVIDIA CUDA** — GPU acceleration
352- **Apple Silicon** — CoreML
353- **CPU** — optimised fallback, works everywhere
354
355---
356
357## Environment Variables
358
359| Variable | Default | Purpose |
360|---|---|---|
361| `VEKTOR_SUMMARIZE` | `false` | Enable LLM session summarization on ingest |
362| `VEKTOR_TRIPLES` | `true` | Enable batch triple extraction on ingest |
363| `VEKTOR_FORESIGHT` | `true` | Extract future-tense foresight signals |
364| `VEKTOR_TEMPORAL` | `true` | Enable temporal index and date boosting |
365| `VEKTOR_CONTRADICT` | `true` | Enable ADD-only contradiction detection |
366| `VEKTOR_DEBUG` | — | Enable verbose retrieval debug output |
367| `VEKTOR_MODEL` | `Xenova/bge-small-en-v1.5` | Swap embedding model (e.g. bge-large for higher accuracy) |
368| `VEKTOR_RERANK` | `true` | Enable cross-encoder reranking |
369
370---
371
372## Licence
373
374Commercial licence granted. 
375Monthly fee - all updates included
376
377Solo $9/mo → 3 licences |
378Team $35/mo →  5 licences |
379Studio $59/mo →  10 licences |
380Enterprise $99/mo →  25 licences |
381
382Purchase: [vektormemory.com/product#pricing](https://vektormemory.com/product#pricing)
383Docs: [vektormemory.com/docs](https://vektormemory.com/docs)
384Support: hello@vektormemory.com
385
386---
387
388## Research
389
390Built on peer-reviewed research:
391
392- [MAGMA (arxiv:2601.03236)](https://arxiv.org/abs/2601.03236) — Multi-Graph Agentic Memory Architecture
393- [EverMemOS (arxiv:2601.02163)](https://arxiv.org/abs/2601.02163) — Self-Organizing Memory OS
394- [HippoRAG (arxiv:2405.14831)](https://arxiv.org/abs/2405.14831) — Neurobiologically Inspired Long-Term Memory (NeurIPS 2024)
395- [Mem0 (arxiv:2504.19413)](https://arxiv.org/abs/2504.19413) — Production-Ready Agent Memory
396- [LoCoMo Benchmark](https://arxiv.org/abs/2402.17753) — Long-Context Conversational Memory evaluation
397

Full transparency — inspect the skill content before installing.