COGNITIVEX · COMPARE
Zep pricing, explained
Zep meters on Episodes, every message you write into memory. CognitiveX meters on recall, and lets you store for free. Here's how each bill actually adds up, and which model fits your agent's shape.
THE SHORT ANSWER
Zep bills you to remember. We bill you to recall.
Zep prices on Episodes. An Episode is, roughly, a unit of input you feed into memory, a message or event ingested into a user or session graph, from which Zep extracts facts and maintains its temporal knowledge graph. You start with a free allotment of credits, then move onto paid plans and buy more credit as you scale. The meter ticks on the write. That makes Zep's pricing easy to start and easy to under-estimate: the cost driver is how much your agents talk, not how often you actually use what they remembered.
CognitiveX inverts that. Storing memories is free; you spend recall credits only when you read, and those are priced by depth: a Foundational recall costs 1, a Standard recall 3, a Deep recall 10. Plans are flat: Amnesiac (free), Awakened at $20/mo, and Conscious at $200/mo, with pay-as-you-go overflow and a spend cap you set. The decision rule is simple: is your agent write-heavy or read-heavy?
SIDE BY SIDE
Zep pricing vs CognitiveX, line by line
| Dimension | Zep | CognitiveX |
|---|---|---|
| What you're billed on | Per Episode: each message / event written to memory | Per recall: reads, by depth. Storing memories is free. |
| Writing / ingestion | Metered; every Episode you add consumes credits | Free; store as much as your plan's memory cap allows |
| Reading / retrieval | Included with the Episode model; graph search on top | Metered: Foundational 1 · Standard 3 · Deep 10 credits |
| Free tier | A starter allotment of credits, then paid plans | Amnesiac: 100 recall credits/mo · 1k memories |
| Paid entry | Usage- / seat-based plans above the free credits | Awakened: $20/mo ($199/yr) · 500k credits · 1M memories |
| Top tier | Enterprise / volume pricing on request | Conscious: $200/mo ($1,999/yr) · 15M credits · unlimited |
| Overflow | Buy more Episode credits as you scale | Pay-as-you-go metered, with a spend cap you set |
| Cost driver to watch | Write volume; chatty agents burn Episodes fast | Recall depth; Deep recalls cost more than shallow ones |
Zep's exact published numbers change over time, so check their site for the current credit allotments. The CognitiveX figures here are our live plans. What's structural, and won't change, is which event the meter watches: Zep watches writes, CognitiveX watches recalls.
DO THE MATH
Which model is cheaper for your agent?
The honest answer depends on your read/write ratio, so reason about your own traffic rather than a headline price:
- Write-heavy, read-light. A logging agent, an ingestion pipeline, a chatty support bot that records everything and queries rarely. Per-Episode metering taxes exactly the thing you do most. Here a model where storage is free and you only pay on recall tends to win.
- Read-heavy, write-light. A research assistant that holds a stable corpus and queries it constantly. Recall-metered pricing has to be sized for that read volume, so pick your plan by expected recalls, and favor Standard depth over Deep where Deep isn't earning its 10 credits.
- Balanced. Compare like-for-like: estimate Episodes/month for Zep and recalls/month (by depth) for CognitiveX, then map each onto its plan or pay-as-you-go rate. Don't forget that under CognitiveX a growing memory store costs nothing until you read it back.
BEYOND PRICE · THE LCM
Pricing is a proxy. Ask what you're paying for.
A per-Episode meter and a per-recall meter both bill access to storage and retrieval. CognitiveX charges for recall because recall is where the work happens, but the memory you're recalling from isn't a passive store. It's the Large Cognition Model (LCM), where the memory is the model.
Concretely, a recall draws on a four-tier memory (semantic, episodic, procedural, foundational), with pattern detection and salience scoring deciding what surfaces, overnight dream consolidation that compresses and links memories while the system is idle, and reflection / introspection so the memory has a view of itself. An LLM does query → model → response → forget; the LCM closes the loop: query → living memory → reasoning → learning → evolution. It speaks MCP, so any agent (and any swappable LLM) recalls through it. That's why depth is a real pricing axis: a Deep recall is doing more cognition, not just pulling more rows.
GET STARTED
Store for free. Pay when you recall.
This is cognitive infrastructure that makes every LLM smarter, not another LLM wrapper. Plug it in over MCP, keep your model swappable, and never get taxed for remembering.
See the full plans on pricing · switching off Zep? Read the Zep alternative · or browse every comparison.