How do I install MCP Memory Service?

Install MCP Memory Service with a single command: npx mdskills install doobidoo/mcp-memory-service. This downloads the skill files into your project and your AI agent picks them up automatically.

What platforms support MCP Memory Service?

MCP Memory Service works with Claude Code, Claude Desktop, Cursor, Vscode Copilot, Windsurf, Continue Dev, Gemini Cli, Amp, Roo Code, Goose. Skills use the open SKILL.md format which is compatible with any AI coding agent that reads markdown instructions.

mcp-memory-service

Name: MCP Memory Service: AI Agent Skill
Brand: doobidoo
Availability: InStock
Rating: 8 (1 reviews)
Author: doobidoo

Persistent Shared Memory for AI Agent Pipelines

Open-source memory backend for multi-agent systems. Agents store decisions, share causal knowledge graphs, and retrieve context in 5ms — without cloud lock-in or API costs.

Works with LangGraph · CrewAI · AutoGen · any HTTP client · Claude Desktop

Why Agents Need This

Without mcp-memory-service	With mcp-memory-service
Each agent run starts from zero	Agents retrieve prior decisions in 5ms
Memory is local to one graph/run	Memory is shared across all agents and runs
You manage Redis + Pinecone + glue code	One self-hosted service, zero cloud cost
No causal relationships between facts	Knowledge graph with typed edges (causes, fixes, contradicts)
Context window limits create amnesia	Autonomous consolidation compresses old memories

Key capabilities for agent pipelines:

Framework-agnostic REST API — 15 endpoints, no MCP client library needed
Knowledge graph — agents share causal chains, not just facts
X-Agent-ID header — auto-tag memories by agent identity for scoped retrieval
conversation_id — bypass deduplication for incremental conversation storage
SSE events — real-time notifications when any agent stores or deletes a memory
Embeddings run locally via ONNX — memory never leaves your infrastructure

Agent Quick Start

pip install mcp-memory-service
MCP_ALLOW_ANONYMOUS_ACCESS=true memory server --http
# REST API running at http://localhost:8000

import httpx

BASE_URL = "http://localhost:8000"

# Store — auto-tag with X-Agent-ID header
async with httpx.AsyncClient() as client:
    await client.post(f"{BASE_URL}/api/memories", json={
        "content": "API rate limit is 100 req/min",
        "tags": ["api", "limits"],
    }, headers={"X-Agent-ID": "researcher"})
    # Stored with tags: ["api", "limits", "agent:researcher"]

# Search — scope to a specific agent
    results = await client.post(f"{BASE_URL}/api/memories/search", json={
        "query": "API rate limits",
        "tags": ["agent:researcher"],
    })
    print(results.json()["memories"])

Framework-specific guides: docs/agents/

Comparison with Alternatives

	Mem0	Zep	DIY Redis+Pinecone	mcp-memory-service
License	Proprietary	Enterprise	—	Apache 2.0
Cost	Per-call API	Enterprise	Infra costs	$0
Framework integration	SDK	SDK	Manual	REST API (any HTTP client)
Knowledge graph	No	Limited	No	Yes (typed edges)
Auto consolidation	No	No	No	Yes (decay + compression)
On-premise embeddings	No	No	Manual	Yes (ONNX, local)
Privacy	Cloud	Cloud	Partial	100% local
Hybrid search	No	Yes	Manual	Yes (BM25 + vector)
MCP protocol	No	No	No	Yes
REST API	Yes	Yes	Manual	Yes (15 endpoints)

Stop Re-Explaining Your Project to AI Every Session

MCP Memory Service

Your AI assistant forgets everything when you start a new chat. After 50 tool uses, context explodes to 500k+ tokens—Claude slows down, you restart, and now it remembers nothing. You spend 10 minutes re-explaining your architecture. Again.

MCP Memory Service solves this.

It automatically captures your project context, architecture decisions, and code patterns. When you start fresh sessions, your AI already knows everything—no re-explaining, no context loss, no wasted time.

🎥 2-Minute Video Demo

![MCP Memory Service Demo](https://img.youtube.com/vi/veJME5qVu-A/maxresdefault.jpg)

Technical showcase: Performance, Architecture, AI/ML Intelligence & Developer Experience

⚡ Works With Your Favorite AI Tools

🤖 Agent Frameworks (REST API)

LangGraph · CrewAI · AutoGen · Any HTTP Client · OpenClaw/Nanobot · Custom Pipelines

🖥️ CLI & Terminal AI (MCP)

Claude Code · Gemini Code Assist · Aider · GitHub Copilot CLI · Amp · Continue · Zed · Cody

🎨 Desktop & IDE (MCP)

Claude Desktop · VS Code · Cursor · Windsurf · Raycast · JetBrains · Sourcegraph · Qodo

💬 Chat Interfaces (MCP)

ChatGPT (Developer Mode) · Claude Web

Works seamlessly with any MCP-compatible client or HTTP client - whether you're building agent pipelines, coding in the terminal, IDE, or browser.

💡 NEW: ChatGPT now supports MCP! Enable Developer Mode to connect your memory service directly. See setup guide →

🚀 Get Started in 60 Seconds

Express Install (recommended for most users):

pip install mcp-memory-service
# Auto-configure for Claude Desktop (macOS/Linux)
python -m mcp_memory_service.scripts.installation.install --quick

What just happened?

✅ Installed memory service
✅ Configured optimal backend (SQLite)
✅ Set up Claude Desktop integration
✅ Enabled automatic context capture

Next: Restart Claude Desktop. Your AI now remembers everything across sessions.

📦 Alternative: PyPI + Manual Configuration

pip install mcp-memory-service

Then add to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS):

{
  "mcpServers": {
    "memory": {
      "command": "memory",
      "args": ["server"]
    }
  }
}

🔧 Advanced: Custom Backends & Team Setup

For production deployments, team collaboration, or cloud sync:

git clone https://github.com/doobidoo/mcp-memory-service.git
cd mcp-memory-service
python scripts/installation/install.py

Choose from:

SQLite (local, fast, single-user)
Cloudflare (cloud, multi-device sync)
Hybrid (best of both: 5ms local + background cloud sync)

💡 Why You Need This

The Problem

Session 1	Session 2 (Fresh Start)
You: "We're building a Next.js app with Prisma and tRPC"	AI: "What's your tech stack?" ❌
AI: "Got it, I see you're using App Router"	You: Explains architecture again for 10 minutes 😤
You: "Add authentication with NextAuth"	AI: "Should I use Pages Router or App Router?" ❌

The Solution

Session 1	Session 2 (Fresh Start)
You: "We're building a Next.js app with Prisma and tRPC"	AI: "I remember—Next.js App Router with Prisma and tRPC. What should we build?" ✅
AI: "Got it, I see you're using App Router"	You: "Add OAuth login"
You: "Add authentication with NextAuth"	AI: "I'll integrate NextAuth with your existing Prisma setup." ✅

Result: Zero re-explaining. Zero context loss. Just continuous, intelligent collaboration.

🌐 SHODH Ecosystem Compatibility

MCP Memory Service is fully compatible with the SHODH Unified Memory API Specification v1.0.0, enabling seamless interoperability across the SHODH ecosystem.

Compatible Implementations

Implementation	Backend	Embeddings	Use Case
shodh-memory	RocksDB	MiniLM-L6-v2 (ONNX)	Reference implementation
shodh-cloudflare	Cloudflare Workers + Vectorize	Workers AI (bge-small)	Edge deployment, multi-device sync
mcp-memory-service (this)	SQLite-vec / Hybrid	MiniLM-L6-v2 (ONNX)	Desktop AI assistants (MCP)

Unified Schema Support

All SHODH implementations share the same memory schema:

✅ Emotional Metadata: emotion, emotional_valence, emotional_arousal
✅ Episodic Memory: episode_id, sequence_number, preceding_memory_id
✅ Source Tracking: source_type, credibility
✅ Quality Scoring: quality_score, access_count, last_accessed_at

Interoperability Example: Export memories from mcp-memory-service → Import to shodh-cloudflare → Sync across devices → Full fidelity preservation of emotional_valence, episode_id, and all spec fields.

✨ Quick Start Features

🧠 Persistent Memory – Context survives across sessions with semantic search 🔍 Smart Retrieval – Finds relevant context automatically using AI embeddings ⚡ 5ms Speed – Instant context injection, no latency 🔄 Multi-Client – Works across 13+ AI applications ☁️ Cloud Sync – Optional Cloudflare backend for team collaboration 🔒 Privacy-First – Local-first, you control your data 📊 Web Dashboard – Visualize and manage memories at http://localhost:8000 🧬 Knowledge Graph – Interactive D3.js visualization of memory relationships 🆕

🖥️ Dashboard Preview (v9.3.0)

8 Dashboard Tabs: Dashboard • Search • Browse • Documents • Manage • Analytics • Quality (NEW) • API Docs

📖 See Web Dashboard Guide for complete documentation.

🆕 Latest Release: v10.17.16 (February 23, 2026)

Security: Fix minimatch ReDoS and Replace Abandoned PyPDF2 with pypdf

What's New:

minimatch ReDoS fixed (Dependabot #3, #6 — High severity): Pinned minimatch to ^10.2.1 in npm test packages, eliminating a ReDoS attack vector.
PyPDF2 replaced with pypdf (Dependabot moderate — Infinite Loop): Migrated from the unmaintained PyPDF2 to its official successor pypdf; no functional change to PDF ingestion.

Previous Releases:

v10.17.15 - Permission-Request Hook Made Opt-In (no silent global hook installation, CLI flags added)
v10.17.14 - Security + Performance: CVE-2024-23342 (ecdsa Minerva attack) eliminated via PyJWT migration, CWE-209 fixed, MCP_ASSOCIATION_MAX_PAIRS raised 100→1000
v10.17.13 - Security: Final 4 CodeQL Alerts Resolved (log-injection, stack-trace-exposure) — Zero Open Alerts
v10.17.12 - Security: File Restoration + 43 CodeQL Alerts (repeated-import, multiple-definition, log-injection, stack-trace-exposure)
v10.17.11 - Security: 6 CodeQL Alerts Resolved (log injection, stack-trace-exposure, unused variable)
v10.17.10 - Security: All 30 Remaining CodeQL Alerts Resolved (log injection, clear-text logging, URL redirection, stack-trace-exposure)
v10.17.9 - Security: 17 CodeQL Alerts Resolved (clear-text logging, log injection, tarslip, ReDoS, URL redirection)
v10.17.8 - Security: 27 CodeQL Alerts Resolved (clear-text logging, log injection, stack-trace-exposure, URL redirection, polynomial ReDoS, empty-except, unused imports)
v10.17.7 - Security: 100 CodeQL Alerts Resolved (security + code quality)
v10.17.6 - Code Quality: 100 CodeQL Import Alerts Resolved Across 51 Files (unused-import, repeated-import, cyclic-import)
v10.17.5 - Security Patch: upgrade 15 vulnerable dependencies (38 Dependabot alerts - h11, aiohttp, starlette, cryptography, pillow, protobuf, and more)
v10.17.3 - Security + Code Quality: 21 CodeQL Scanning Alerts Resolved (log injection CWE-117, HTTPClientStorage signature, import-time prints)
v10.17.2 - CI Stability Fixes: uv CLI test timeout 60s→120s, CI job timeout 10→20min, root install.py test skip guard
v10.17.1 - Hook System Bug Fixes + Root Installer + Session-Start Reliability (session-end SyntaxError on Node.js v24, MCP_HTTP_PORT detection, exponential backoff retry)
v10.17.0 - Default "untagged" Tag for All Tagless Memories + Cleanup Script (306 production memories retroactively fixed)
v10.16.1 - Windows MCP Initialization Timeout Fix (MCP_INIT_TIMEOUT env override, 7 unit tests)
v10.16.0 - Agentic AI Market Repositioning with REST API Integration Guides (LangGraph, CrewAI, AutoGen guides, X-Agent-ID header auto-tagging, agent: tag namespace)
v10.15.1 - Stale Venv Detection for Moved/Renamed Projects (auto-recreate venv when pip shebang interpreter path is missing)
v10.15.0 - Config Validation & Safe Environment Parsing (validate_config() at startup, safe_get_int_env(), 8 new robustness tests)
v10.14.0 - conversation_id Support for Incremental Conversation Saves (semantic dedup bypass, metadata storage, all backends)
v10.13.2 - Consolidation & Hybrid Storage Bug Fixes (missing StorageProtocol proxy methods, timezone-aware datetime, contributed by @VibeCodeChef)
v10.13.1 - Critical Bug Fixes (tag search limits, REST API field access, metadata corruption, hash display, prompt handler crashes)
v10.13.0 - Test Suite Stability (100% pass rate, 1,161 passing tests, authentication testing patterns)
v10.12.1 - Custom Memory Type Configuration Test Fixes (test isolation, environment cleanup)
v10.12.0 - Configurable Memory Type Ontology (75 types supporting PM and knowledge work, custom type configuration)
v10.11.2 - Tag Filtering & Security Hardening (DoS protection, SQL-level optimization, comprehensive tests)
v10.11.1 - MCP Prompt Handlers Fix (all 5 prompt handlers working, 100% success rate restored)
v10.11.0 - SQLite Integrity Monitoring (automatic corruption detection/repair, 3.5ms overhead, emergency export)
v10.10.6 - Test Infrastructure Improvements (Python 3.11 compatibility, pytest-benchmark, coverage baseline)
v10.10.5 - Embedding Dimension Cache Fix (dimension mismatch prevention, cache consistency)
v10.10.4 - CLI Batch Ingestion Fix (async bug causing "NoneType" errors, 100% success rate restored)
v10.10.3 - Test Infrastructure & Memory Scoring Fixes (graph validation, test authentication, score capping)
v10.10.2 - Memory Injection Filtering (minRelevanceScore enforcement, project-affinity filter, security hardening)
v10.10.1 - Search Handler Fix, Import Error Fix, Security Enhancement, Improved Exact Search
v10.10.0 - Environment Configuration Viewer (11 categorized parameters, sensitive masking, Settings Panel integration)
v10.9.0 - Batched Inference Performance (4-16x GPU speedup, 2.3-2.5x CPU speedup with adaptive GPU dispatch)
v10.9.0 - Batched Inference Performance (4-16x GPU speedup, 2.3-2.5x CPU speedup with adaptive GPU dispatch)
v10.8.0 - Hybrid BM25 + Vector Search (combines keyword matching with semantic search, solves exact match problem)
v10.7.2 - Server Management Button Fix (Settings modal buttons causing page reload)
v10.7.1 - Dashboard API Authentication Fix (complete auth coverage for all endpoints)
v10.7.0 - Backup UI Enhancements (View Backups modal, backup directory display, enhanced API)
v10.6.1 - Dashboard SSE Authentication Fix (EventSource API compatibility with query params)
v10.6.0 - Server Management Dashboard: Complete server administration from Dashboard Settings
v10.5.1 - Test Environment Safety: 4 critical scripts to prevent production database testing
v10.5.0 - Dashboard Authentication UI: Graceful user experience (authentication modal, API key/OAuth flows)
v10.4.6 - Documentation Enhancement: HTTP dashboard authentication requirements clarified (authentication setup examples)
v10.4.5 - Unified CLI Interface: memory server --http flag (easier UX, single command)
v10.4.4 - CRITICAL Security Fix: Timing attack vulnerability in API key comparison (CWE-208) + API Key Auth without OAuth
v10.4.2 - Docker Container Startup Fix (ModuleNotFoundError: aiosqlite)
v10.4.1 - Bug Fix: Time Expression Parsing (natural language time expressions fixed)
v10.4.0 - Memory Hook Quality Improvements (semantic deduplication, tag normalization, budget optimization)
v10.2.1 - MCP Client Compatibility & Delete Operations Fixes (integer enum fix, method name corrections)
v10.2.0 - External Embedding API Support (vLLM, Ollama, TEI, OpenAI integration)
v10.1.2 - Windows PowerShell 7+ Service Management Fix (SSL compatibility for manage_service.ps1)
v10.1.1 - Dependency & Windows Compatibility Fixes (requests dependency, PowerShell 7+ SSL support)
v10.1.0 - Python 3.14 Support (Extended compatibility to 3.10-3.14, tokenizers upgrade)
v10.0.3 - CRITICAL FIX: Backup Scheduler Now Works (2 critical bugs fixed, FastAPI lifespan integration)
v10.0.2 - Tool List Cleanup (Only 12 unified tools visible, 64% tool reduction complete)
v10.0.1 - CRITICAL HOTFIX: MCP tools loading restored (Python boolean fix)
v10.0.0 - ⚠️ BROKEN: Major API Redesign (64% Tool Consolidation) - Tools failed to load, use v10.0.2 instead
v9.3.1 - Critical shutdown bug fix (SIGTERM/SIGINT handling, clean server termination)
v9.3.0 - Relationship Inference Engine (Intelligent association typing, multi-factor analysis, confidence scoring)
v9.2.1 - Critical Knowledge Graph bug fix (MigrationRunner, 37 test fixes, idempotent migrations)
v9.2.0 - Knowledge Graph Dashboard with D3.js v7.9.0 (Interactive force-directed visualization, 6 typed relationships, 7-language support)
v9.0.6 - OAuth Persistent Storage Backend (SQLite-based for multi-worker deployments, Note: All heavy ML dependencies (PyTorch, sentence-transformers) are now optional to dramatically reduce build times and image sizes. SQLite-vec uses lightweight ONNX embeddings by default. Install with --with-ml for full ML capabilities.

🪶 Lite Distribution 🆕 v8.76.0

For resource-constrained environments (CI/CD, edge devices):

pip install mcp-memory-service-lite

Benefits:

90% size reduction: 7.7GB → 805MB
ONNX-only: No transformers dependency
Same performance: Identical quality scoring
Ideal for: CI/CD pipelines, Docker images, embedded systems

Trade-offs:

Local-only quality scoring (no Groq/Gemini fallback)
ONNX embeddings only (no PyTorch)

🚀 Production Ready

Cross-platform - Windows, macOS, Linux
Service installation - Auto-start background operation
HTTPS/SSL - Secure connections with OAuth 2.1
Docker support - Easy deployment with team collaboration
Interactive Dashboard - Web UI at http://127.0.0.1:8000/ for complete management

💡 Basic Usage

📄 Document Ingestion (v8.6.0+)

# For local development/single-user: Enable anonymous access
export MCP_ALLOW_ANONYMOUS_ACCESS=true

# Start HTTP dashboard server (separate from MCP server)
memory server --http

# Access interactive dashboard
open http://127.0.0.1:8000/

# Upload documents via CLI
curl -X POST http://127.0.0.1:8000/api/documents/upload \
  -F "file=@document.pdf" \
  -F "tags=documentation,reference"

# Search document content
curl -X POST http://127.0.0.1:8000/api/search \
  -H "Content-Type: application/json" \
  -d '{"query": "authentication flow", "limit": 10}'

⚠️ Authentication Required: The HTTP dashboard requires authentication by default. For local development, set MCP_ALLOW_ANONYMOUS_ACCESS=true. For production, use API key authentication (MCP_API_KEY) or OAuth. See Configuration for details.

🔗 Team Collaboration with OAuth (v7.0.0+)

# Start OAuth-enabled HTTP server for team collaboration
export MCP_OAUTH_ENABLED=true
memory server --http

# Claude Code team members connect via HTTP transport
claude mcp add --transport http memory-service http://your-server:8000/mcp
# → Automatic OAuth discovery, registration, and authentication

🧠 Memory Operations

# Store a memory
uv run memory store "Fixed race condition in authentication by adding mutex locks"

# Search for relevant memories (hybrid search - default in v10.8.0+)
uv run memory recall "authentication race condition"

# Use hybrid search via HTTP API for exact match + semantic
curl -X POST http://127.0.0.1:8000/api/search \
  -H "Content-Type: application/json" \
  -d '{
    "query": "OAuth 2.1 authentication",
    "mode": "hybrid",
    "limit": 10
  }'

# Search by tags
uv run memory search --tags python debugging

# Check system health (shows OAuth status)
uv run memory health

🔧 Configuration

Claude Desktop Integration

Recommended approach - Add to your Claude Desktop config (~/.claude/config.json):

{
  "mcpServers": {
    "memory": {
      "command": "python",
      "args": ["-m", "mcp_memory_service.server"],
      "env": {
        "MCP_MEMORY_STORAGE_BACKEND": "sqlite_vec"
      }
    }
  }
}

Alternative approaches:

// Option 1: UV tooling (if using UV)
{
  "mcpServers": {
    "memory": {
      "command": "uv",
      "args": ["--directory", "/path/to/mcp-memory-service", "run", "memory", "server"],
      "env": {
        "MCP_MEMORY_STORAGE_BACKEND": "sqlite_vec"
      }
    }
  }
}

// Option 2: Direct script path (v6.17.0+)
{
  "mcpServers": {
    "memory": {
      "command": "python",
      "args": ["/path/to/mcp-memory-service/scripts/server/run_memory_server.py"],
      "env": {
        "MCP_MEMORY_STORAGE_BACKEND": "sqlite_vec"
      }
    }
  }
}

Environment Variables

Hybrid Backend (v8.9.0+ RECOMMENDED):

# Hybrid backend with auto-configured pragmas
export MCP_MEMORY_STORAGE_BACKEND=hybrid
export MCP_MEMORY_SQLITE_PRAGMAS="busy_timeout=15000,cache_size=20000"

# Cloudflare credentials (required for hybrid)
export CLOUDFLARE_API_TOKEN="your-token"
export CLOUDFLARE_ACCOUNT_ID="your-account"
export CLOUDFLARE_D1_DATABASE_ID="your-db-id"
export CLOUDFLARE_VECTORIZE_INDEX="mcp-memory-index"

# Enable HTTP API
export MCP_HTTP_ENABLED=true
export MCP_HTTP_PORT=8000

# Security (choose one authentication method)
# Option 1: API Key authentication (recommended for production)
export MCP_API_KEY="your-secure-key"

# Option 2: Anonymous access (local development only)
# export MCP_ALLOW_ANONYMOUS_ACCESS=true

# Option 3: OAuth team collaboration
# export MCP_OAUTH_ENABLED=true

SQLite-vec Only (Local):

# Local-only storage
export MCP_MEMORY_STORAGE_BACKEND=sqlite_vec
export MCP_MEMORY_SQLITE_PRAGMAS="busy_timeout=15000,cache_size=20000"

Hybrid Search (v10.8.0+):

# Enable hybrid BM25 + vector search (default: enabled)
export MCP_HYBRID_SEARCH_ENABLED=true

# Configure score fusion weights (must sum to ~1.0)
export MCP_HYBRID_KEYWORD_WEIGHT=0.3    # BM25 keyword match weight
export MCP_HYBRID_SEMANTIC_WEIGHT=0.7   # Vector similarity weight

# Adjust weights based on your use case:
# - More keyword-focused: 0.5 keyword / 0.5 semantic
# - More semantic-focused: 0.2 keyword / 0.8 semantic
# - Default balanced: 0.3 keyword / 0.7 semantic (recommended)

Note: Hybrid search is only available with sqlite_vec and hybrid backends. It automatically combines BM25 keyword matching with vector similarity for better exact match scoring while maintaining semantic capabilities.

Response Size Management 🆕 v9.0.0

Control maximum response size to prevent context overflow:

# Limit response size (recommended: 30000-50000)
export MCP_MAX_RESPONSE_CHARS=50000  # Default: unlimited

Applies to all retrieval tools:

retrieve_memory, recall_memory, retrieve_with_quality_boost
search_by_tag, recall_by_timeframe

Behavior:

Truncates at memory boundaries (preserves data integrity)
Recommended: 30000-50000 characters for optimal context usage

External Embedding APIs

Use external embedding services instead of running models locally:

# vLLM example
export MCP_EXTERNAL_EMBEDDING_URL=http://localhost:8890/v1/embeddings
export MCP_EXTERNAL_EMBEDDING_MODEL=nomic-ai/nomic-embed-text-v1.5

# Ollama example
export MCP_EXTERNAL_EMBEDDING_URL=http://localhost:11434/v1/embeddings
export MCP_EXTERNAL_EMBEDDING_MODEL=nomic-embed-text

# OpenAI example
export MCP_EXTERNAL_EMBEDDING_URL=https://api.openai.com/v1/embeddings
export MCP_EXTERNAL_EMBEDDING_MODEL=text-embedding-3-small
export MCP_EXTERNAL_EMBEDDING_API_KEY=sk-xxx

Benefits:

Share embedding infrastructure across multiple MCP instances
Offload GPU/CPU to dedicated servers
Use models not available in SentenceTransformers
Use hosted services (OpenAI, Cohere)

Note: Only supported with sqlite_vec backend. See docs/deployment/external-embeddings.md for detailed setup.

🏗️ Architecture

┌─────────────────┐    ┌─────────────────┐    ┌─────────────────┐
│   AI Clients    │    │  MCP Memory     │    │ Storage Backend │
│                 │    │  Service v8.9   │    │                 │
│ • Claude Desktop│◄──►│ • MCP Protocol  │◄──►│ • Hybrid 🌟     │
│ • Claude Code   │    │ • HTTP Transport│    │   (5ms local +  │
│   (HTTP/OAuth)  │    │ • OAuth 2.1 Auth│    │    cloud sync)  │
│ • VS Code       │    │ • Memory Store  │    │ • SQLite-vec    │
│ • Cursor        │    │ • Semantic      │    │ • Cloudflare    │
│ • 13+ AI Apps   │    │   Search        │    │                 │
│ • Web Dashboard │    │ • Doc Ingestion │    │ Zero DB Locks ✅│
│   (Port 8000)   │    │ • Zero DB Locks │    │ Auto-Config ✅  │
└─────────────────┘    └─────────────────┘    └─────────────────┘

🛠️ Development

Project Structure

mcp-memory-service/
├── src/mcp_memory_service/    # Core application
│   ├── models/                # Data models
│   ├── storage/               # Storage backends
│   ├── web/                   # HTTP API & dashboard
│   └── server.py              # MCP server
├── scripts/                   # Utilities & installation
├── tests/                     # Test suite
└── tools/docker/              # Docker configuration

Contributing

Fork the repository
Create a feature branch
Make your changes with tests
Submit a pull request

See CONTRIBUTING.md for detailed guidelines.

🆘 Support

📖 Documentation: Wiki - Comprehensive guides
🐛 Bug Reports: GitHub Issues
💬 Discussions: GitHub Discussions
🔧 Troubleshooting: Troubleshooting Guide
✅ Configuration Validator: Run python scripts/validation/validate_configuration_complete.py to check your setup
🔄 Backend Sync Tools: See scripts/README.md for Cloudflare↔SQLite sync

📊 In Production

Real-world metrics from active deployments:

1700+ memories stored and actively used across teams
5ms local reads with hybrid backend (v8.9.0)
Zero database locks with concurrent HTTP + MCP access (v8.9.0)
- Tested: 5/5 concurrent writes succeeded
- Auto-configured pragmas prevent lock errors
** 500 chars → important
Recognizes: release notes, API docs, setup guides, session summaries

Preventing Future Cleanup Issues

Version 8.64.0+:

✅ Soft-delete with tombstone support (v8.64.0)
✅ Bidirectional sync race condition fix (v8.64.0)
✅ Cloudflare hybrid sync validation (v8.64.1)

Best practices:

Use meaningful tags from the start
Review untagged memories regularly
Run cleanup scripts after major changes
Verify tags in Dashboard before deletion