Kevin William David

Deepchecks LLM Evaluation - Validate, monitor, and safeguard LLM-based apps

byβ€’
Continuously validate LLM-based applications including LLM hallucinations, performance metrics, and potential pitfalls throughout the entire lifecycle from pre-deployment and internal experimentation to production.πŸš€

Add a comment

Replies

Best
Poly Das
Very Excited for the launch
Ebi Sotarede
Awesome and great
Sinan
I've been experimenting with LLM evaluation metrics on my own for a while now. This is a pretty good solution, will definitely try it out. How do you imagine the future of CI/CD for LLM applications?
philip tannor
@sakameister great question, this has been a question for testing classic ML as well. I can imagine a process kind of like GitHub Actions that runs suites of tests, and some of them may need to involve making sure some manual annotations happened
Roni Ahmed
Amazing project
Andres Olarte
Interesting
Prateek
Congrats team Deepchecks LLM Evaluation on your launch.
hamo Gaber
very good
Kumar De silva
Good
Antura Pratihar
This is excellent
Saroj
@shirch : Congrats on the launch team, the product looks amazing.
First
Previous
β€’β€’β€’
3456
Next
Last