Langfuse is the #1 open source LLM engineering platform. It helps teams iterate on and improve their LLM applications: LLM tracing, metrics, llm-as-a-judge, evaluations, prompt management, datasets testing and more. Create a free account or self-host Langfuse.
Hi Product Hunt 👋👋👋
I’m Clemens, co-founder of Langfuse. We are so excited to be live on Product Hunt again today, launching the biggest feature yet: Langfuse Prompt Experiments!
This new feature is designed to close the loop of developing and improving LLM applications.
With Prompt Experiments, you can test and evaluate different prompt versions and LLMs on hundreds of examples simultaneously; Perform live LLM-as-a-judge evaluations and compare results in our new Dataset Comparison View to optimize prompts for your specific use case.
You can quickly make prompt changes, see whether they improve metrics without causing new issues, and then deploy to production with confidence. Move faster on your LLM Application development than ever before.
A few things you should know about Langfuse:
👣 Tracing: TS & Python & API + OpenAI, Llama Index, LangChain, LiteLLM + many more
✏️ Prompt Management: Collaboratively manage and deploy prompts from within Langfuse
⚖️ Evaluation: Automatically run evals to control for hallucinations or other criteria you care about
💾 Datasets: Collaboratively manage fine-tuning, testing and golden datasets
🧪 Dataset Experiments: Test different prompts and models on hundreds of traces simultaneously
📊 Metrics: Dashboards and analytics on cost, latency and quality
🕹️ LLM Playground: Engineer your prompts and directly see results
🏎️ Export & Fine Tune: Open GET API and csv/JSON exports to build downstream use cases
🚄 Scale: We’ve invested significantly in scaling and resilience as we’ve scaled to thousands of users and handle many many millions of events a day
🧑🤝🧑Community: Join our thousands of users on GitHub Discussions and Discord
See for yourself:
⭐ GitHub: https://github.com/langfuse/lang...
⏯️ Interactive Demo: https://langfuse.com/demo
📒 Docs: https://langfuse.com/docs
💬 Discord: https://langfuse.com/discord
Thanks again, PH community. This is where we started, and we are incredibly grateful for all of the community feedback which led to the development of this major new feature.
We’re excited and will be in the comments the whole day to hear your thoughts & feedback!
Congratulations on the launch!! 🚀We have been using Langfuse from day one and it helped us tremendously in getting deep visibility into the performance of our AI stack!
thanks @marc_metz, prompt and mode changes can be very risky. Glad that we can help mitigate some of the risk with langfuse now via strong evals and tests -- without needing to use our SDKs
Report
Langfuse Prompt Experiments is a powerful tool for optimizing and managing LLM applications! The ability to iterate and test different prompt versions, while ensuring efficiency and reliability, will certainly speed up development cycles. The integration of prompt management, evaluation, and dataset comparison is a game-changer for teams working with LLMs.
How do you handle edge cases in prompt testing to ensure your models are robust across various scenarios?
> How do you handle edge cases in prompt testing to ensure your models are robust across various scenarios?
@hritik_choudhary, this mainly depends on the datasets used for the experiment. The closer they resemble production, the more robust the test will be.
🚀 Congrats, this is BIG! 🚀
Very excited to see how far you have come in just one year! And so excited to see how much further you will go from here.
✨ Very exciting product and a great team! ✨
⭐️⭐️⭐️⭐️⭐️ Recommended!
Congrats team on the producthunt launch and the whole launch week! Really big fan of the new features around experimenting with prompts and the new eval features! Glad to have not to do any more vibe-based evaluation!
@dan_meier1 this is the main goal, move to more structured evaluation which helps to confidently make decision - without having to set up a whole testing suite and thus making this more accessible to everyone involved in prompt engineering
Replies
Langfuse
Langfuse
Langdock
Langfuse
Langdock
Langfuse
WizFairy
Langfuse
Langfuse
AccessOwl
Langfuse
Langfuse
Learn Animation Online - Resources
Langfuse
Yapify
Langfuse