Riyad Sarsour

Dutchman Labs - Eval Studio - Test Your Agents Faster

by
Speed up your testing and agent validation

Add a comment

Replies

Best
Mykola Kondratiuk

curious how the test runner handles non-determinism. agents give different outputs on identical inputs - that is not a bug, but it breaks most eval frameworks expecting stable assertions.