Steven Willmott's profile on Product Hunt

All activity

21m ago

Spec27 is a validation platform for AI agents. It helps teams move beyond manual, vibes-based testing by using machine-readable specifications to generate broader test coverage, catch regressions earlier, and validate both in-house and third-party systems without needing SDK integration or code-level access.

Spec27Spec-driven testing for AI agents and AI apps

Steven Willmottstarted a discussion

12h ago

What kind of Agent validation are you doing today?

Everything started with model Evals and benchmarks (which model is better?), then evolved to prompt management and from there to analyzing traces. What do people do today, and how are they sourcing test datasets?

Steven Willmottleft a comment

13h ago

Hello Product Hunt, excited to be here! For the past three years, we’ve been working on formal verification of machine learning models and looking for ways to get deep, relevant test coverage. With language-model-based applications, this is particularly hard since the models and input spaces are massive, plus you often don’t have access to the underlying model. So the techniques from formal...

Spec27Spec-driven testing for AI agents and AI apps