You're mid-task. Claude is in flow. Then the plan limit hits and everything stops. You know the feeling — the session cuts out, the context is gone, and you're starting over. For heavy Claude Code users, this isn't an occasional annoyance. It's a regular ceiling on what you can get done in a day. We built Edgee's Claude Code Compressor to push that ceiling back.

Edgee Claude Code Compressor gallery image

Free

Launch tags:Software Engineering•Developer Tools

Launch Team

AppSignal — Get full visibility into app health, errors, and performance

Get full visibility into app health, errors, and performance

Promoted

Edgee

Maker

📌

❤️ Today, we're launching the @Edgee Claude Code Compressor.
I want to show you what it does with a real-world test scenario, so I recorded this video.

I created two separate Claude Code sessions, each connected to a dedicated plan. Same codebase, same task, same instructions: one side standard Claude Code; the other routed through Edgee with compression enabled.

Left side stops at 21 instructions. Right side reaches 26.5.

+26.5% more session before hitting your plan limit.

Here's how it works: Edgee sits between Claude Code and the Anthropic API. Before each request is sent, it strips redundant context, deduplicates instructions, and sends a leaner prompt. Claude sees less noise. You get more range.

To install: curl -fsSL https://install.edgee.ai | bash

Then: edgee launch claude

That's it. Free. Takes 30 seconds to set up.

If you're a Claude Code user who's hit the plan wall mid-task, this is for you. If you're running Claude on Anthropic's API and watching your token bill grow, this is also for you.

We've been in beta for a few weeks. Today it's out for everyone.

Report

6d ago

Hunter

neat product - keep up the great work, @sachamorard and team 👏👏

Report

5d ago

Edgee

Maker

You rock @fmerian ! Thank you very much for supporting and highlighting this incredible feature.

Report

3d ago

Cipherwill

@sachamorard does code quality declines??

Report

3d ago

Edgee

Maker

@shivam1337 Absolutely not, on the contrary. Compression for Claude Code applies to tool results that are very often too verbose. For example, when the model asks your Claude Code to execute a git log, the model doesn't need unnecessary details. Our compressor cleans up all the polluting elements.

Report

3d ago

Congrats. A very clever solution to a black-box problem. I’d be interested in learning more about your business model. Will your service offer a paid plan? That would mitigate the impact of the AI provider’s pricing. Or perhaps you monetize the data, since you act as a middleman, which would make it harder for me to choose a solution like this..

Report

3d ago

Edgee

Maker

@barnabed we do not monetize the data, because we do not store the prompts ! Never, ever ! We offer other services for enterprises, like a compressor for agentic use cases, multi LLM, edge tools, caching…

Report

3d ago

@sachamorard thanks for your answer 👏👏👏

Report

3d ago

Would be great to see a breakdown or visualization of what’s being removed vs kept. That could help build trust in the compression layer.

Report

3d ago

Edgee

Maker

@nikita_jain18 you’re right. When you finish a Claude session with Edgee, you can access to a dashboard that shows the savings. And if you activate the debug mode, you also have access to the detail of what we optimized.

Report

3d ago

Hit that Claude limit mid-flow way too many times 😅 this kind of compression feels like a simple fix that actually saves real time + money.

Report

2d ago

Edgee

Maker

@allinonetools_net Simple and efficient. Just a simple CLI install (with brew or culr), then `edgee launch claude`... and that's it, you save up to 50% of token cost :)

Report

2d ago

Interesting that the fallback sends the original prompt when BERT score is too low. Smart safety net. One thing I'd watch though: Claude Code already runs its own context compression internally, and there are known issues where that causes it to lose track of CLAUDE.md instructions. Adding another compression layer on top might amplify that. Have you tested how the two interact?

Report

2d ago

Using Edgee already, really great product.

Super simple idea but actually makes a difference on costs

Report

3d ago

Edgee

Maker

@thierry_abalea We are very proud to have your support, especially coming from an entrepreneur like you who is achieving great things.

Report

3d ago

More tokens, fewer plan interruptions 🙌

Report

3d ago

Edgee

Maker

@maxwell_timothy Thanks a lot. Don't hesitate to try it, it's 100% free

Report

3d ago

1 2

Previous Edgee Launches

EdgeeThe AI Gateway that TL;DR tokens

Launched on February 12th, 2026

Forum Threads

p/edgee

•

2mo ago

Token Compression for LLMs: How to reduce context size without losing accuracy

Hey, I'm Sacha, co-founder at @Edgee

Over the last few months, we've been working on a problem we kept seeing in production AI systems:

LLM costs don't scale linearly with usage, they scale with context.
As teams add RAG, tool calls, long chat histories, memory, and guardrails, prompts become huge and token spend quickly becomes the main bottleneck.

So we built a token compression layer designed to run before inference.

View all