Evidently is an open-source framework to evaluate, test and monitor AI-powered apps.
π 100+ built-in checks, from classification to RAG.
π¦ Both offline evals and live monitoring.
π Easily add custom metrics and LLM judges.
Replies
Best
Elena, this is an impressive step for Evidently! Expanding into LLM evaluation is so needed in today's landscape. With the variety of built-in checks and the flexibility to add custom metrics, it really simplifies a complex area. The challenges you outlined are so relatableβhaving a structured approach will be a game-changer for many developers. Excited to see how the community will contribute to this project! Keep pushing those boundaries! π
Congrats with the launch! Great milestone @elenasamuylova and @emeli_dral! Evidenlty is part of my MLOps stack and I recommend it to my friends and clients! I'm happy to contribute to Evidently and look forward to collaboration!
@aadral Thanks for your support! Looking forward to LLM-related feature requests π
Report
Congrats on the launch, @elenasamuylova! π It's amazing to see Evidently evolving into the realm of LLMs with such robust features. The focus on a quality workflow is crucial for us as we develop AI-powered applications. I love the idea of easily integrating custom metrics and having that interactive summary for evaluations. Looking forward to exploring the new capabilities and contributing to the community! Keep up the great work!
@bunga_trisnulia We make it easy for the user to focus only on the evaluation's contents (for example, write that "I want to label responses as concise or verbose") without thinking about how to write the rest of the evaluation prompt. We automatically add all the other parts, such as formatting prompts as JSON to get structured output, asking LLM to provide the reasoning before outputting the label, etc.
Basically, we help the users to define only what's strictly necessary but do all the boilerplate on the background.
Report
Congrats on the launch, Elena and Emeli! It's nice to see it released in open source!
@eugene_ter_avakyan Thank you! We are looking forward to the community input π
Report
Congrats on this launch, Elena! The transition from traditional ML to LLMs is a game changer. The ability to customize metrics and have a monitoring dashboard will definitely help many makers in evaluating their AI apps. Can't wait to see how the community uses Evidently! π
Hey @elenasamuylova! Exciting stuff with Evidently stepping into the LLM space! The challenges you've outlined around evaluating generative AI are real. Love the
Replies
Evidently AI
Evidently AI
DVC
Evidently AI
Evidently AI
Evidently AI
Heyday Health
Evidently AI
Evidently AI
Evidently AI
Evidently AI
Elisi : AI-powered Goal Management App
Evidently AI