You and your team's decisions, carried forward by your AI agents.
From your conversations with AI, agent work, external sources, and more, structure and accumulate your and your team's decisions, state, and relationships — organized and kept current automatically. The memory platform for the age of AI agents, turning them into knowledge assets the next agent can use right away.
We'll send your invite once it's ready. If requests are high, there may be a wait — thank you for your patience.
Struggling with AI memory?
The more you use AI, the more you re-explain from scratch — sound familiar?
If you use Claude, ChatGPT, Cursor, or Gemini every day, you keep hitting three walls.
01
Memory keeps growing — but it doesn't cross tools
Even as ChatGPT and Claude add memory, it stays inside each tool. Switch tools or projects and you often re-explain your structure and past decisions all over again.
Do I really have to re-explain to Cursor what I decided in Claude?
02
Your knowledge never reaches the team
The design decisions and research you built up with AI live only in your own chat history. Teammates can't see them, and if you're away, that knowledge disappears entirely.
Only they know that spec — guess I'm researching it again.
03
The longer the context, the higher the cost
Pasting your README, specs, and past threads every time inflates the token count and your bill — and you still can't tell how much got through.
I pasted so much context my API bill doubled.
SOLVED WITH Kagura AI
With Kagura AI, your decisions and your agents' work all become memory.
The more you use it, the more Kagura AI organizes automatically — growing into a knowledge base your whole team can use.
Step 01
Capture automatically
Capture your AI conversations, agent activity, files, and external sources via MCP, REST API, or auto-sync. You never have to decide what to keep.
Step 02
Organize automatically
It merges duplicates, links related memories, and fixes stale or conflicting information. The more you use it, the more accurate your knowledge base becomes.
Step 03
Share across AIs
Share the same memory across multiple AIs like Claude, Cursor, Codex, and Gemini. With MCP, CLI, and REST API support, continue with the same context from any tool.
Step 04
Share with your team
One person's insight becomes the whole team's asset. Configure which memories are shared and who can see them, by role.
FOR WHOM
If you feel like you explain the same things to AI or your team every time, this is for you
Whatever your AI experience level, Kagura AI helps in situations like these.
Individual
Individual engineers who use AI tools daily
If you write code with Claude Code or Cursor: stop re-describing your project on every new chat, and stop re-teaching last week's bug fix to the AI.
Teams
Dev teams using AI together
Your team uses ChatGPT or Claude, but no one can see who decided what. No more digging through Slack to find how a bug was handled — the whole team uses AI well.
AI beginners
Non-engineers just starting with AI
If you're unsure what to tell the AI or never feel like you're using it well: context accumulates automatically from past conversations, so you stop re-explaining the basics.
Rollout
People driving AI adoption company-wide
Want to roll out ChatGPT but worried about leaks, or teams using it in silos? See who uses what, and separate accessible information by department.
Agencies
SIers & contractors juggling multiple projects
Keep client A's and B's requirements from blurring together. Separate memory per client and reuse past proposals and design decisions on the next job.
Agent dev
Developers orchestrating multiple AI agents
Want to pass what agent A found to agent B, or share one context across agents? Embed Kagura AI as a shared memory all your agents can reference.
FEATURE LIST
Built from two products
A combination of Cloud, which saves and retrieves memory, and Worker, which organizes and grows it automatically.
CLOUD
Kagura Memory Cloud
Available from the Trial plan
A cloud home for your AI's memory
Just send work notes and decisions from AI tools like Claude, Cursor, or ChatGPT — they're organized and stored automatically. Connect via MCP, CLI, or REST API, ready to recall from the next session or another tool.
Save memoryremember
Hand over a conversation or note and it decides what matters, then organizes and stores it. No need to think about what or how to save.
Example Just tell Claude: 'Save today's bug-fix notes to Kagura.'
Recall memoryrecall
Pull up related memories from vague phrases like 'that thing last week' or 'the login design decision'. It searches by meaning, even when keywords don't match exactly.
Example 'Did we decide anything about auth before?' — instant hit.
Connected memoryGraph
Automatically links the relationships between saved memories. It records context like 'this bug came from that design decision', so you can pull related info together.
Example Recall 'memory A' and related 'B and C' come with it.
Per-project memoryContext
Keep memory separated by project or client. No more mixing up client A with client B — hand the AI only the memory that's relevant.
Example Split Contexts for projects A and B, managed as independent memory.
Connect from multiple tools & pathsMCP / CLI / API
Access the same memory from different AI tools like Claude, Cursor, Gemini, and Codex. Connect not only via MCP but also CLI and REST API — the same context works from any tool, any method.
Example Reference a decision saved in Cursor from Claude, the CLI, or the API.
Access control & historyGovern
Set who can see which memory. For team use, control things like 'this memory is mine only' or 'this project is open to the whole team'. Operation history is kept too.
Example Limit confidential project memory to the assignee only.
Capture & syncCapture
Bring in conversations, files, and external sources as memory — send them manually or pull them in via auto-sync. Consolidate scattered information in one place.
Example Import meeting notes and GitHub discussions into Kagura at once.
Hybrid searchHybrid Search
Combines keyword (BM25), semantic (vector), and relational (graph) search, so even vague questions reach the right memory.
Example Half-remembered wording still surfaces the related memories.
Web dashboardDashboard
Search, review, and edit your saved memory from a browser screen. No code needed to see and organize the whole picture.
Example Browse all memories on screen and clean up what you don't need.
WORKER
Kagura Memory Worker
Available on Starter and above
Organizes and grows your memory automatically
Left alone, memory fills up with stale info and duplicates. Worker reviews it on a schedule in the background and keeps it in a usable state.
Auto cleanupSleep Maintenance
Periodically checks all memory and merges or removes entries that are stale, duplicated, or contradictory. No manual upkeep required.
Example Three saves of the same spec get merged into one.
Pattern discoveryMemory Analysis
Analyzes accumulated memory and reports trends like recurring mistakes or topics the team often debates. Use it as a hint for retrospectives and improvement.
Example Surfaces that 'auth-error memories are increasing.'
Send only what's neededCompile
When handing memory to the AI, instead of sending everything it auto-selects only what relates to the current task and keeps it compact. Less noise means better answers and lower cost.
Example From 1,000 memories, it extracts just the 10 relevant to your task and sends them to Claude.
Individual knowledge to the teamTeam Sharing
Automatically converts what you built up with AI into a form the whole team can use. No more 'only they know it', and work is less likely to stall when someone's away.
Example One person's research notes are shared into the team's memory base automatically.
Links to sourcesSource Ref
Automatically records links to the sources behind a memory, like GitHub Issues and docs. You can verify the rationale later, and the risk of the AI producing wrong info drops.
Example 'This design is based on the discussion in Issue #42' — linked automatically.
Whole-memory reviewBroadlistening
As memory grows, it steps back and re-evaluates 'memories whose importance changed' and 'topics with more connections'. It maintains quality periodically so search accuracy holds even as volume grows.
Example Finds that a 3-month-old memory relates to your current project.
COST BENEFITS
Lower cost while memory becomes a usable knowledge asset
All figures below are simulations. Actual savings vary by project size, usage, and the AI model you use. Please treat them as pre-adoption reference only.
All figures below are simulations, not measured results.
Token cost reduction
Stop sending everything — pass only relevant memory
~72% fewer tokens in our own estimate (simulated)
Stop sending everything — cut wasted tokens
Sends only relevant memory instead of full context, cutting tokens by ~72% while keeping your AI precisely informed.
Assumptions & method
Full-context: ~25,000 tokens / call (assumption)
Compile Layer: ~7,000 tokens / call (assumption)
At $3 / 1M input tokens → ~72% fewer tokens per call
Kagura AI's own simulation (not measured)
Maintained accuracy
Avoids long-context decay
based on NoLiMa (ICML 2025) and related benchmarks
Less clutter means knowledge your AI can actually use
Accuracy drops as context grows. Delivering only relevant memory avoids the degradation confirmed across the latest generation of models.
Assumptions & method
NoLiMa (ICML 2025 / arXiv:2502.05167): at 32K tokens, many models fall below 50% of their short-context baseline
Later-generation large models also show long-context accuracy degradation in benchmarks
Kagura AI's own simulation (not measured)
Compounding knowledge
Builds up as you use it
team / ongoing-use estimate
Less re-explaining means knowledge compounds into an asset
The more you log decisions and work, the less you re-explain. Personal memory compounds into a shared team asset over time.
Assumptions & method
Assume ~3,000 tokens per re-explanation, computed at $3 / 1M input tokens
Cutting 20 re-explanations/day per person → ~$5.4/month direct savings (scales with team size)
Plus reduced time spent explaining (varies by frequency and scale)
Axis
Without Memory Layer (full send)
Kagura AI (Compile Layer)
Difference (simulated)
Tokens per call
~25,000+
~7,000 or fewer
~72% less
Monthly API (1 person, 100/day)
~$162/mo
~$63/mo
~$99 saved
Cost at 3× scale
~3× increase
Growth suppressed
Lower growth rate
New session startup
Explain from scratch
Starts with memory
No explanation cost
All of the above are Kagura AI's own simulations and do not guarantee actual results. While modern AI context windows have expanded to 200K–1M tokens or more, academic research consistently shows that feeding long context directly into models reduces accuracy. NoLiMa (ICML 2025) found that at 32K tokens, many models fall below 50% of their short-context baseline, and long-context accuracy degradation has also been reported for later-generation large models. Monthly cost estimates use a representative input price of $3 / 1M tokens; actual rates vary by model. Actual savings vary widely by project size, call frequency, and the nature of the context. We recommend validating in your own environment before adoption.
Primary source: Modarressi et al. (2025), 'NoLiMa: Long-Context Evaluation Beyond Literal Matching' (arXiv:2502.05167 / ICML 2025) / token rates per AI model providers' published pricing
PRICING
Find the right way to start
Begin with the invite-only trial. Scale to Starter or Pro as your team grows. Custom and large-scale needs are covered by Enterprise.
Kagura AI is a memory platform for AI agents. It's a shared place where AIs like Claude, ChatGPT, Cursor, and Gemini remember your team's work logs, decisions, and state — storing and organizing memory so any tool can recall the same context and pick up where you left off.
Yes. Even though it's a memory platform, the basics work like chatting: tell Claude or ChatGPT 'remember this' to save, and ask 'what did we decide about that?' to recall. MCP and API connections are available too, but we recommend starting with the trial plan from the screen first.
ChatGPT's memory lives only inside ChatGPT. Kagura AI is an independent memory platform that doesn't belong to any one tool, so multiple AI agents — Claude, Cursor, Gemini, Codex — share the same memory. It also adds team sharing, per-project separation, and reference history.
RAG retrieves documents from a large set and hands them to the AI. As a memory platform, Kagura AI structures not just documents but the state and history of when, who, and how something was decided. It can complement RAG.
Data on the platform is private to you by default. Only memory you set to team-shared becomes accessible to permitted members. For enterprise use, you can self-host the platform on your own servers.
Yes. We currently offer an invite-only trial plan for free — up to 1,000 memories and 1 workspace — to try the platform's core features. Just reach out via 'Request an invite'.
/compact summarizes the current conversation into a file. Kagura AI further structures that summary and stores it on the platform as memory you can search and reuse from future sessions and other AI tools.
It depends on the platform pricing plan. Starter: 2 workspaces, up to 5 trial invites. Pro: 5 workspaces, up to 20 trial invites. For larger scale or company-wide rollout, talk to us about Enterprise. Pricing is being finalized and will be published on this page.
Yes. A Source Ref feature links memory to GitHub Issues and PRs, and other tools like Slack connect via API/MCP. With the platform at the center, context scattered across tools is consolidated in one place.
Reach out to contact@kagura-ai.com via 'Request an invite'. If you want to try it solo, we'll guide you to the trial plan. If you're considering team or enterprise adoption, contact us via the 'Business inquiries' form.