Launched this week

TokenZip

Launched this week

Open protocol for AI agents to share memory, not tokens

38 followers

Open protocol for AI agents to share memory, not tokens

38 followers

Visit website

AI Infrastructure Tools

TokenZip Protocol reduces AI-to-AI communication bandwidth by 80% and latency by 95%. Open standard for heterogeneous agents. Try the live demo. Test API Base URL: https://tokenzip.org Auth: Authorization: Bearer demo-investor-key How to call it: See the following comment

Free

Launch tags:API•Artificial Intelligence•GitHub

Launch Team / Built With

Framer — Launch websites with enterprise needs at startup speeds.

Launch websites with enterprise needs at startup speeds.

Promoted

Hunter

📌

Yet in the cutting-edge LLM communications of 2026, we are still brute-forcing massive plaintext strings (Pass-by-Value)?We are effectively paying a pointless "idiot tax" for machines to simply re-read things. This is exactly why I drafted the TokenZip Protocol (TZP). It is the first "Universal Semantic Shared Memory Standard" designed for heterogeneous AI agents. Stop passing tens of thousands of tokens of waffle, and start passing pointers of less than 10 characters. How does it work? Through TZP, Agent A can compress massive context into a unified semantic vector space, cache it at edge nodes (essentially building a semantic CDN for AI), and then only pass a short ID to Agent B, for example: [TZP: tx_8f9A2b]. The result? Bandwidth latency plummets, communication efficiency across different models skyrockets, and most importantly—you save up to 90% on Token costs! Big tech might not lose sleep over a rounding error in their server electricity bills, but as a solo developer, I have to rescue my own credit card. The first API MVP and v1.0 draft of TokenZip are now live on GitHub. It's still an early-stage project, but I firmly believe the "pass-by-reference standard for the AI era" must be defined by the open-source community.

Report

1d ago

Prava

Interesting, It it better than RAG based systems. I mean, even there you can plug same memory across multiple agents. Any specific reason you chose this approach?

Report

18h ago

Hunter

@shubham_kukreti When you use RAG, the retrieved chunks change slightly every time based on the query. This constantly modifies the prompt prefix, completely destroying OpenAI and Anthropic's native Prompt Caching. Your cache hit rate drops to zero.

TokenZip does the exact opposite. Because we restore the exact same massive text block via the pointer, we force a 100% cache hit rate on the LLM provider's side. RAG makes API bills higher; TokenZip weaponizes native caching to drop bills by 90%.

To use RAG across agents, a developer needs to spin up Pinecone, pick an embedding model, write chunking logic, and manage vector states. It takes a week.

To use TokenZip, they change api.openai.com to gateway.trexapi.com. It takes 15 seconds. We are building a network layer, not a database.

Report

18h ago

Prava

@tokenzip understood

Report

18h ago

How does it actually manage to shrink all that AI memory into such tiny pointers?
Super curious to try it out.

Report

12h ago

Hunter

@lucas_turner2 It doesn’t literally pack the whole memory into the pointer itself. The pointer is just a TrexID.

What happens is:

we extract the task-relevant context
preserve high-risk details exactly, like code, IDs, numbers, and structured fields
compress the low-signal narrative parts
store that managed context behind a TrexID

So the “tiny pointer” is really a reference to a processed context object, not the raw memory itself.

That’s why later requests can pass a short TrexID instead of resending the whole history.

Happy to get you early access if you want to try it.

Report

7h ago

This is exactly what I need. Been wasting so much bandwidth passing huge tokens between my bots.

Report

12h ago

Hunter

@oliver_hayes1 That’s exactly the problem we’re solving.

Instead of passing huge token payloads between bots every step, Trex lets them reference shared context through a TrexID. Much less duplication, much lower bandwidth, much cleaner handoff.

Happy to get you early access if you want to try it.

Report

7h ago

Would love a simple visual of how agents share memory. Might take it easier to onboard new users.

Report

12h ago

Hunter

@daniel_brooks8 Totally agree. A simple visual would help a lot.

The core idea is just:

agent creates context -> Trex stores it behind a TrexID -> other agents reuse that TrexID instead of passing full memory around.

Much easier to understand in a diagram than in text. We’re putting one together.

Report

7h ago