CueMap

Relevance Compression

The difference between fuzzy matching and precise truth.

Standard RAG: The Token Bloat

Vectors retrieve fuzzy matches. You pay for chunks of noise for every relevant fact.

CueMap: The Compressed Signal

CueMap intersects Context + Time + Habit. You pay only for grounded context. Precise. Deterministic.

Lower Costs

Reduce output token usage up to 90% by injecting only high-signal memories.

Less Hallucination

Feed your LLM verifiable, deterministic context via the Grounded Recall API.

Better Answers

Provide the "needle in the haystack" without the haystack.

Search Enhancement

Turn vague queries into precise results.
CueMap's cue co-occurrence graph and self-learning lexicon expand user intent before it hits your database.

Scenario: The 3 AM Incident

Vague Input

"checkout failed"

CueMap

Expanded Context

stripe

timeout

504

Your database returns only the Stripe integration logs with timeout errors. Millions of unrelated logs filtered out instantly.

Query Expansion

Use CueMap's Context Expansion API to expand user intent before querying your database.

User Profiling

Track user interests in CueMap's graph to re-rank results from your main database based on individual context.

Deterministic Intent

Bridge the gap between vague user queries and precise database records with explainable, auditable expansions.

The Algorithm

Biologically inspired. Mathematically continuous.

1. Intersection

The Context Filter

Unlike vector search which finds "neighbors," CueMap intersects cues directly. More cues → smaller candidate set → higher precision.

2. Signal Attenuation

The Freshness Gradient

Just like biological signals fade over distance, memory relevance decays over time (1/(1+t)). CueMap models this with a continuous gradient, ensuring recent events naturally overpower old habits.

3. Hebbian Reinforcement

The Weight of Habit

"Cells that fire together, wire together." Every recall event physically strengthens the memory's signal. Critical knowledge gains massive "mass," preventing it from being displaced by trivial events.

Explainable Retrieval

Know exactly why every result was returned. No black boxes.

Vector Databases

Great for similarity. Hard to audit.

Result #1 0.847

Result #2 0.823

Result #3 0.801

Good for fuzzy matching. Hard to audit why a chunk ranked above another. Time and policy versioning usually bolted-on.

Match Integrity: Proof of Work

Result #1 0.98

Intersection 3/3 cues matched

Recency 2 days ago

Reinforcement Recalled 12x

Salience High

Source auth.py:42

Every score is decomposable. Every source is verifiable. Audit-ready.

Self-Learning Context

Bootstrap cues automatically. Let the engine learn your context.

Context Daemon

Watches your codebase and files in real-time. Supports most popular file formats out of the box. Automatically extracts cues using high-precision parsers.

The Lexicon

A self-learning map of your vocabulary. Because the Lexicon is running on its own internal CueMap engine, it uses biological reinforcement to automatically disambiguate words based on how you actually use them.

Alias Discovery

CueMap automatically surfaces synonyms and related terms from your usage patterns. No manual synonym mapping required, the engine discovers relationships on its own.

Semantic Bootstrapping

Don't spend weeks defining synonyms. CueMap comes pre-loaded with WordNet for instant English associations.

Coming Soon: The Public Context API. Instantly inherit millions of global associations, from pop culture to deep tech, without training a single byte.

Performance

Real benchmarks on real Wikipedia data. No marketing fluff.

Ingestion (Write) Performance

Time to parse a raw sentence and extract semantic cues.

Dataset Scale	Avg Latency	P50 (Median)	Throughput	Scaling
10,000	2.33 ms	1.99 ms	~429 ops/s	—
100,000	2.30 ms	1.94 ms	~435 ops/s	🟢 Flat
1,000,000	2.34 ms	2.00 ms	~427 ops/s	🟢 O(1)

Observation: Ingestion latency is effectively O(1). Scaling from 10k → 1M memories results in zero latency penalty (2.00ms flat).

Recall (Read) Performance

Time to parse a query, perform pattern completion (context expansion), and intersect the semantic graph.

Dataset Scale	Operation	Avg Latency	P50 (Median)	P99 (Tail)
100,000	Smart Recall With PC	1.37 ms	1.33 ms	2.62 ms
100,000	Raw Recall No PC	1.34 ms	1.26 ms	2.45 ms
1,000,000	Smart Recall With PC	1.70 ms	1.63 ms	3.16 ms
1,000,000	Raw Recall No PC	1.69 ms	1.61 ms	3.35 ms

What is Smart Recall? Pattern Completion (PC) automatically expands your query with inferred cues from the co-occurrence graph. Even with PC enabled, 1M item search (1.63ms) is ~10x faster than a 60Hz screen refresh (16ms).

Observation: 10x more data = only ~20% more latency. Sub-linear scaling means you can grow without fear.

Benchmark Setup: Machine: Apple M1 Max, 64GB RAM | Engine: single-node Rust server | Dataset: Real Wikipedia articles, full NLP pipeline | Workload: between 5 to 10 cues per memory See full methodology →

<2ms

Ingestion Speed (O(1))

<2ms

1M Smart Recall (P50)

10x→1.2x

Sub-linear Read Scaling

O(1)

Write Scaling

CueMap excels at multi-dimensional, time-aware queries where explicit cue intersection matters more than semantic similarity.

Built for Production

Not a toy. Not a prototype. A production-grade memory engine.

Native Namespaces (Multi-Tenancy)

Isolate & Correlate

Partition data into logical namespaces (e.g., news, stocks, logs) for strict isolation. Then, perform Cross-Namespace Unions instantly to find correlations between siloed datasets.

Crash-Safe Persistence

Local Snapshots + Cloud Backup

State is automatically serialized to disk using Bincode (compact binary format). Near zero-latency snapshots happen in the background. Instant recovery on restart.

Cloud Backup: Sync snapshots to any cloud storage (AWS S3, Google Cloud Storage, Azure Blob) for disaster recovery and cross-region replication.

Compression

Zstd Compressed

High-ratio compression to minimize footprint. Optimized for speed and efficiency.

Encryption

XChaCha20-Poly1305

State-of-the-art encryption at rest. Zero-knowledge architecture ensures your data remains secure.

CueMap

Relevance Compression

Lower Costs

Less Hallucination

Better Answers

Search Enhancement

Query Expansion

User Profiling

Deterministic Intent

The Algorithm

1. Intersection

2. Signal Attenuation

3. Hebbian Reinforcement

Explainable Retrieval

Vector Databases

CueMap

Self-Learning Context

Context Daemon

The Lexicon

Alias Discovery

Semantic Bootstrapping

Performance

Ingestion (Write) Performance

Recall (Read) Performance

Built for Production

Native Namespaces (Multi-Tenancy)

Crash-Safe Persistence

Compression

Encryption

Resources

Engine

SDKs

PyPI

NPM

Docker

Ready to get started?