Kevin William David

Deepchecks LLM Evaluation - Validate, monitor, and safeguard LLM-based apps

byβ€’
Continuously validate LLM-based applications including LLM hallucinations, performance metrics, and potential pitfalls throughout the entire lifecycle from pre-deployment and internal experimentation to production.πŸš€

Add a comment

Replies

Best
Sinan
I've been experimenting with LLM evaluation metrics on my own for a while now. This is a pretty good solution, will definitely try it out. How do you imagine the future of CI/CD for LLM applications?
philip tannor
@sakameister great question, this has been a question for testing classic ML as well. I can imagine a process kind of like GitHub Actions that runs suites of tests, and some of them may need to involve making sure some manual annotations happened
Andres Olarte
Interesting
Prateek
Congrats team Deepchecks LLM Evaluation on your launch.
hamo Gaber
very good
Kumar De silva
Good
Antura Pratihar
This is excellent
Saroj
@shirch : Congrats on the launch team, the product looks amazing.
Md Zaman
Nice product.
Sree Sarkar
This is amazing product. Great stuff, we are using deepchecks for our internal LLM evaluation, requires couple of minutes to get big insights! I really like it.
Sima Khan
Very nice product... I love it
First
Previous
β€’β€’β€’
345
Next
Last