Spec-driven testing for AI agents and AI apps

What kind of Agent validation are you doing today?

Spec27

•4d ago

Everything started with model Evals and benchmarks (which model is better?), then evolved to prompt management and from there to analyzing traces. What do people do today, and how are they sourcing test datasets?

6 views

Replies

Be the first to comment