AutoMem: Research-Validated AI Memory 🧠

Graph + Vector architecture proven to match human long-term memory performance.

# Deploy in 60 seconds
railway up
# Or run locally
make dev

Persistent memory for your AI.

The Problem We Solve

AI assistants forget everything between sessions. RAG systems retrieve context but can't learn patterns. Vector databases find similar text but miss relationships.

You need AI that actually remembers.

What AutoMem Does

AutoMem is a graph-vector memory service that gives AI assistants durable, relational memory:

🧠 Stores memories with rich metadata, importance scores, and temporal context
🔍 Recalls with hybrid search - vector similarity + keyword + tags + time
🔗 Builds knowledge graphs - 11 relationship types between memories
🎯 Learns patterns - automatic entity extraction, clustering, and consolidation
⚡ Sub-second recall - even with millions of memories

Research-Validated Architecture

AutoMem implements principles from:

HippoRAG 2 (2025): Graph-vector hybrid for human-like associative memory
A-MEM (2025): Dynamic memory organization with Zettelkasten principles
MELODI (DeepMind, 2025): 8x memory compression without quality loss
ReadAgent (DeepMind, 2024): 20x context extension through gist memories

Architecture

┌─────────────────────────────────────────────┐
│           AutoMem Service (Flask)           │
│   • REST API for memory lifecycle           │
│   • Background enrichment pipeline          │
│   • Consolidation engine                    │
│   • Automated backups (optional)            │
└──────────────┬──────────────┬───────────────┘
               │              │
        ┌──────▼──────┐  ┌───▼────────┐
        │  FalkorDB   │  │   Qdrant   │
        │   (Graph)   │  │ (Vectors)  │
        │             │  │            │
        │ • 11 edge   │  │ • Semantic │
        │   types     │  │   search   │
        │ • Pattern   │  │ • 768-d    │
        │   nodes     │  │   vectors  │
        └─────────────┘  └────────────┘

FalkorDB (graph) = canonical record, relationships, consolidation
Qdrant (vectors) = semantic recall, similarity search
Dual storage = Built-in redundancy and disaster recovery

Why Graph + Vector?

Traditional RAG (Vector Only)

Memory: "Chose PostgreSQL for reliability"
Query: "What database should I use?"
Result: ✅ Finds the memory
         ❌ Doesn't know WHY you chose it
         ❌ Can't connect to related decisions

AutoMem (Graph + Vector)

Memory: "Chose PostgreSQL for reliability"
Graph: PREFERS_OVER MongoDB
       RELATES_TO "team expertise" memory
       DERIVED_FROM "boring technology" principle

Query: "What database should I use?"
Result: ✅ Finds the memory
        ✅ Knows your decision factors
        ✅ Shows related preferences
        ✅ Explains your reasoning pattern

Real-World Performance

Knowledge Graphs That Learn

# After storing: "Migrated to PostgreSQL for operational simplicity"

AutoMem automatically creates:
├── Entity: PostgreSQL (tagged: entity:tool:postgresql)
├── Entity: operational simplicity (tagged: entity:concept:ops-simplicity)
├── Pattern: "prefers boring technology" (reinforced)
├── Temporal: PRECEDED_BY migration planning memory
└── Similarity: SIMILAR_TO "Redis deployment" (both value simplicity)

# Next query: "Should we use Kafka?"
AI recalls:
- Your PostgreSQL decision
- Your "boring tech" pattern
- Related simplicity preferences
→ Suggests: "Based on your operational simplicity pattern, 
   consider RabbitMQ instead"

Hybrid Search That Works

# Semantic + keyword + tags + time + importance scoring
GET /recall?query=database&tags=decision&time_query=last%20month

Returns memories ranked by:
- Vector similarity (0.64)
- Tag match (0.50)
- Recency (0.90)
- Exact keyword match (1.00)
Final score: 0.82 (weighted combination)

Features

Core Memory Operations

Store - Rich memories with metadata, importance, timestamps, embeddings
Recall - Hybrid search (vector + keyword + tags + time windows)
Update - Modify memories, auto-regenerate embeddings
Delete - Remove from both graph and vector stores
Associate - Create typed relationships between memories
Filter - Tag-based queries with prefix/exact matching

Memory Consolidation

AutoMem uses dream-inspired consolidation cycles to keep memories fresh and useful:

Decay (Hourly): Exponential relevance scoring based on age, access, relationships, and importance
Creative (Hourly): Discovers non-obvious connections between memories (REM-like processing)
Cluster (6hrs): Groups similar memories and creates meta-patterns
Forget (Daily): Archives low-relevance memories, deletes very old unused ones

Memories aren't deleted immediately - they're archived first (relevance 0.05-0.2), only removed if they drop below 0.05. Wrong rabbit holes fade naturally (~30-45 days without use). Important connected memories stay indefinitely.

Background Intelligence

Enrichment Pipeline

Automatically enhances every memory:

Entity extraction - People, projects, tools, concepts (with spaCy)
Auto-tagging - entity:<type>:<slug> for structured queries
Summaries - Lightweight snippets for quick scanning
Temporal links - PRECEDED_BY to recent memories
Semantic neighbors - SIMILAR_TO via cosine similarity
Pattern detection - Reinforces emerging themes

Consolidation Engine

Keeps memory fresh over time:

Decay (hourly) - Exponential relevance scoring
Creative (hourly) - Discovers surprising associations
Cluster (6-hourly) - Groups similar embeddings, generates meta-memories
Forget (daily) - Archives/deletes low-relevance memories

11 Relationship Types

Build rich knowledge graphs:

Type	Use Case	Example
`RELATES_TO`	General connection	Bug report → Related issue
`LEADS_TO`	Causal relationship	Problem → Solution
`OCCURRED_BEFORE`	Temporal sequence	Planning → Execution
`PREFERS_OVER`	User preferences	PostgreSQL → MongoDB
`EXEMPLIFIES`	Pattern examples	Code review → Best practice
`CONTRADICTS`	Conflicting info	Old approach → New approach
`REINFORCES`	Supporting evidence	Decision → Validation
`INVALIDATED_BY`	Outdated info	Legacy docs → Current docs
`EVOLVED_INTO`	Knowledge evolution	Initial design → Final design
`DERIVED_FROM`	Source tracking	Implementation → Spec
`PART_OF`	Hierarchical structure	Feature → Epic

Quick Start

Option 1: Railway (Recommended)

Deploy AutoMem + FalkorDB to Railway in 60 seconds:

# Install Railway CLI
npm i -g @railway/cli

# Deploy
railway login
railway init
railway up

👉 Deployment Guide for detailed Railway setup

Option 2: Docker Compose (Local)

Run everything locally:

# Clone and start services
git clone https://github.com/verygoodplugins/automem.git
cd automem
make dev

# API: http://localhost:8001
# FalkorDB: localhost:6379
# Qdrant: localhost:6333

Option 3: Development Mode

Run API without Docker:

python3 -m venv venv
source venv/bin/activate
pip install -r requirements-dev.txt
PORT=8001 python app.py

API Examples

Store a Memory

curl -X POST http://localhost:8001/memory \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "content": "Chose PostgreSQL over MongoDB for ACID compliance",
    "type": "Decision",
    "confidence": 0.95,
    "tags": ["database", "architecture"],
    "importance": 0.9,
    "metadata": {
      "source": "architecture-meeting",
      "alternatives": ["MongoDB", "MySQL"],
      "deciding_factors": ["ACID", "team_expertise"]
    }
  }'

Available memory types: Decision, Pattern, Preference, Style, Habit, Insight, Context (default)

Explicit type recommended when you know the classification
Omit type to let enrichment auto-classify from content

Recall Memories

# Hybrid search with tags and time
GET /recall?query=database&tags=decision&time_query=last%20month

# Semantic search with vector
GET /recall?embedding=0.12,0.56,...&limit=10

# Tag prefix matching (finds slack:U123:*, slack:channel-ops, etc.)
GET /recall?tags=slack&tag_match=prefix

Create Relationship

curl -X POST http://localhost:8001/associate \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "memory1_id": "uuid-postgres-decision",
    "memory2_id": "uuid-mongodb-evaluation",
    "type": "PREFERS_OVER",
    "strength": 0.9
  }'

Use With AI Platforms

AutoMem works with any AI platform via:

MCP (Model Context Protocol)

Local MCP Bridge (Claude Desktop, Cursor, Claude Code):

# Install official MCP bridge
npm install -g @verygoodplugins/mcp-automem

# Configure for local AI tools
npx @verygoodplugins/mcp-automem setup

SSE Sidecar (Cloud AI Platforms):

Connect AutoMem to cloud AI platforms via HTTPS. Works with:

ChatGPT (requires developer mode)
Claude.ai web interface
Claude mobile app
ElevenLabs Agents

See MCP over SSE documentation for setup instructions.

👉 Resources:

NPM bridge (local): https://www.npmjs.com/package/@verygoodplugins/mcp-automem
SSE setup guide: docs/MCP_SSE.md

Direct API

Any language, any framework:

import requests

response = requests.post(
    "https://your-automem.railway.app/memory",
    headers={"Authorization": f"Bearer {token}"},
    json={"content": "Memory content", "importance": 0.8}
)

What Makes AutoMem Different

vs. Traditional RAG

✅ Relationships - Not just "similar", but "causes", "prefers", "invalidates"
✅ Temporal awareness - Knows what came before, what evolved from what
✅ Pattern learning - Discovers themes across memories
✅ Consolidation - Memories improve over time, not just accumulate

vs. Vector-Only Databases

✅ Structured relationships - 11 edge types vs cosine similarity only
✅ Background intelligence - Auto-enrichment, clustering, decay
✅ Hybrid scoring - Vector + keyword + tags + time + importance
✅ Knowledge graphs - Traverse relationships, not just retrieve vectors

vs. Building Your Own

✅ Research-validated - Implements HippoRAG 2, A-MEM, MELODI principles
✅ Production-ready - Authentication, admin tools, health monitoring
✅ Battle-tested - Enrichment pipeline, consolidation, retry logic
✅ Open source - MIT license, deploy anywhere

Performance & Reliability

Sub-second recall - Even with 100k+ memories
Concurrent writes - Background enrichment doesn't block API
Graceful degradation - Works without Qdrant (graph-only mode)
Automatic retries - Failed enrichments queue for reprocessing
Health monitoring - /health and /enrichment/status endpoints
Automated backups - Optional backup service for disaster recovery
Dual storage - Data in both FalkorDB and Qdrant provides redundancy
Benchmark validated - Test against LoCoMo (ACL 2024) with make test-locomo

Configuration

Required

AUTOMEM_API_TOKEN - Authentication for all endpoints (except /health)
FALKORDB_HOST / FALKORDB_PORT - Graph database connection

Optional

QDRANT_URL / QDRANT_API_KEY - Enable semantic search
OPENAI_API_KEY - Real embeddings (otherwise deterministic placeholders)
ADMIN_API_TOKEN - Required for /admin/reembed and enrichment controls
Consolidation tuning: CONSOLIDATION_*_INTERVAL_SECONDS
Enrichment tuning: ENRICHMENT_* (similarity threshold, retry limits, etc.)

👉 Full Configuration Guide

Documentation

📦 Installation Guide - Railway, Docker, development setup
🌉 MCP over SSE Sidecar - Expose AutoMem as an MCP server over SSE for ChatGPT/ElevenLabs
💾 Monitoring & Backups - Health monitoring and automated backups
🔧 API Reference - All endpoints with examples
🧪 Testing Guide - Unit, integration, live server, and LoCoMo benchmark tests
📊 LoCoMo Benchmark - Validate against ACL 2024 long-term memory benchmark
🔄 Migration Guide - Move from MCP SQLite
🌐 automem.ai - Official website and guides

Community & Support

🌐 automem.ai - Official website
🐙 GitHub - Source code
📦 NPM MCP Bridge - MCP integration
🐛 Issues - Bug reports and feature requests

The Science

AutoMem's architecture is validated by peer-reviewed research:

HippoRAG 2 (Ohio State, June 2025)
Proves graph-vector hybrid achieves 7% better associative memory than pure vector RAG, approaching human long-term memory performance.

A-MEM (July 2025)
Validates dynamic memory organization with Zettelkasten-inspired principles - exactly what AutoMem's pattern detection and clustering implement.

MELODI (DeepMind, 2024)
Shows 8x memory compression without quality loss through gist representations - AutoMem's summary generation follows these principles.

ReadAgent (DeepMind, 2024)
Demonstrates 20x context extension via episodic memory - AutoMem's consolidation engine implements similar temporal organization.

We didn't just read the papers - we built the system they describe.

Contributing

We welcome contributions! Please:

Fork the repository
Create a feature branch
Add tests for your changes
Submit a pull request

See TESTING.md for running the test suite.

License

MIT - Because AI memory should be free.

Ready to give your AI human-like memory?

railway up

Built with obsession. Validated by neuroscience. Powered by graph theory.

Transform AI from a tool into a thinking partner. Deploy AutoMem now.

Name		Name	Last commit message	Last commit date
Latest commit History 95 Commits
.github/workflows		.github/workflows
.railway		.railway
automem		automem
docs		docs
mcp-sse-server		mcp-sse-server
scripts		scripts
tests		tests
.cursorignore		.cursorignore
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.nixpacksignore		.nixpacksignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
INSTALLATION.md		INSTALLATION.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
app.py		app.py
consolidation.py		consolidation.py
docker-compose.yml		docker-compose.yml
pytest.ini		pytest.ini
railway-template.json		railway-template.json
railway.json		railway.json
railway.toml		railway.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
run-integration-tests.sh		run-integration-tests.sh
test-live-server-auto.sh		test-live-server-auto.sh
test-live-server.sh		test-live-server.sh
test-locomo-benchmark.sh		test-locomo-benchmark.sh

License

verygoodplugins/automem

Folders and files

Latest commit

History

Repository files navigation

AutoMem: Research-Validated AI Memory 🧠

The Problem We Solve

What AutoMem Does

Research-Validated Architecture

Architecture

Why Graph + Vector?

Traditional RAG (Vector Only)

AutoMem (Graph + Vector)

Real-World Performance

Knowledge Graphs That Learn

Hybrid Search That Works

Features

Core Memory Operations

Memory Consolidation

Background Intelligence

Enrichment Pipeline

Consolidation Engine

11 Relationship Types

Quick Start

Option 1: Railway (Recommended)

Option 2: Docker Compose (Local)

Option 3: Development Mode

API Examples

Store a Memory

Recall Memories

Create Relationship

Use With AI Platforms

MCP (Model Context Protocol)

Direct API

What Makes AutoMem Different

vs. Traditional RAG

vs. Vector-Only Databases

vs. Building Your Own

Performance & Reliability

Configuration

Required

Optional

Documentation

Community & Support

The Science

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages