
Agenta
Open-source prompt management & evals for AI teams
576 followers
Open-source prompt management & evals for AI teams
576 followers
Agenta is an open-source LLMOps platform for building reliable AI apps. Manage prompts, run evaluations, and debug traces. We help developers and domain experts collaborate to ship LLM applications faster and with confidence.








Agenta
Hi Product Hunt 👋
I'm Mahmoud, co-founder of Agenta. The team and I are excited to launch Agenta today.
What is Agenta?
Agenta is an open-source platform that helps AI teams ship reliable LLM applications.
The Problem
Building a demo is easy. Building a reliable app is hard.
Small prompt changes improve one case but break another
Subject matter experts and engineers can't collaborate easily (prompts end up scattered across code and spreadsheets)
Teams don't know if their prompts are working in production
How Agenta Solves This
Playground for the whole team. Everyone can experiment with prompts and models, not just engineers.
Deploy without code changes. Anyone can push a working prompt instantly.
Test before you ship. Create test cases and validate prompts against them (no more vibe-based prompting).
Monitor in production. Track mistakes, user feedback and costs after deployment.
Who's Using Agenta
Hundreds of teams use Agenta Cloud (generous free tier) or self-host it. They run more experiments, ship AI features faster, and collaborate in one place.
Try It Yourself
⭐ GitHub: https://github.com/agenta-ai/agenta
☁️ Cloud (free, no credit card): https://cloud.agenta.ai
📚 Docs: https://agenta.ai/docs
Looking forward to your feedback!
@mabrouk, congrats on the launch! you and the team are building something important here. we are fellow Antler company building a cloud platform to optimize the devops cycle. feel free to reach out on LinkedIn and let's chat. I think we can have a win-win here. godspeed!
Agenta
@savian_boroanca Thanks for your kind words! I will reach out!
@mabrouk, looking forward to it! have a fantastic launch day :-)
EasyFrontend
Nice to see a tool that lets both devs and non-tech team members collaborate. Best wishes to the team. One thing I am curious how Agent handles versioning for prompts and evaluations?
Agenta
@getsiful Thanks for your comment! For prompt versioning, we use a Git-like system where you can create branches. Each branch has its own prompt history, so team members can work on their versions independently and then deploy to production. The cool thing is that when you deploy to an environment like production, you don't need to change any code. It all happens within Agenta, and the agent fetches the prompts directly from there.
For evaluation, you create test sets and define evaluators (the metrics you want to measure). When you run evaluations, they connect directly to your prompts so you can see how changes affect performance.
Great product! Can I integrate prompts from it to my app via API/SDK?
Can I use variables in the prompt?
Agenta
@pasha_tseluyko Yes to both :)
MailDrop.dev
I recently evaluated Agenta vs Langfuse for Prompt Management and tracing. I went with Langfuse this time but all the best for this project. Open Source FTW.
Something that would really set you apart, that no one else seems to have, would be approval workflows for Prompt management. Managing prompts in the UI is great but in a remotely business-y environment I can't let one person have the ability to push new prompts without checks and balances. We'll probably have to manage this with source control (e.g. Github) and write some script to push prompts up to Langfuse once they gain approval.
Agenta
@henricook Thanks for the feedback. We like Langfuse too; we know the team and we're both based in Berlin :)
One differentiator for us is the focus on collaboration between subject matter experts (non-technical) and developers. We're building a workflow that's easy to use from the UI and feature-equal to what you can do from code.
We've discussed approval workflows on the team. Right now we solve this through role-based access control. You can configure Agenta so part of the team works on prompts outside of production (we have a branching system for this, so they can work on their branches), and only certain members (like team leads) can deploy prompts to production.
Swytchcode
Oh wow, this is really amazing. Collaborating with the team on prompts and debugging with evaluations is a really cool idea. It seems like AI tools are really evolving :) Also, I see APIs, and that makes it even more exciting.
Would love to try that out.
Agenta
@chilarai Thanks for the kind words! Let me know your feedback!
Swytchcode
@mabrouk Absolutely! I'll share detailed feedback as I try things out.
Also, since I’m building Swytchcode (AI-powered API workflow + testing engine), I'd love to explore if there’s room for open collaboration. Agenta’s evaluation and debugging layer feels super complementary to what we’re doing on the API side.
Happy to sync on LinkedIn, if you’re open to it!
Agenta
@chilarai Definitely! We'll reach out!
Agnes AI
seldomly can see an opensource project for LLMops like Agenta! Great launch and congrats team!
Agenta
@cruise_chen Thanks Cruise!
Agenta
Hi everyone! 👋 We built Agenta to have a way for AI teams to collaborate on prompts. We offer a complete workflow for building reliable AI apps, form prompt engineering, to evaluation and observability. We'd love to hear your thoughts, feedback or ideas — thanks for checking us out! 🙌