Amit Kumar's profile on Product Hunt

Forums

•

2mo ago

How are you measuring your AI drift?

It's a proven fact that none of the AI systems breaks overnight; They decay. They fade, shift, and degrade quietly.
Stanford found GPT-4 accuracy on basic reasoning tasks dropped 97.6% -> 2.4% between March and June:

https://arxiv.org/abs/2307.09009
variA/Bly has evaluated across 10+ workflows, and the same pattern appears:
Accuracy drifts (almost 15 40%), prompts regress, RAG relevance drops, and costs fluctuate (20 50%).
The real truth:
The fact is that AI systems are inherently indeterministic, and hence the drift is natural.
The real business risk is that most of the business owners aren t measuring it.
Recently, we launched a 30-day "AI Drift & Accuracy Pilot" to help teams see how their workflows change week to week.
If you want your drift map, happy to share.

•

2mo ago

variA/Bly - Delivering production-grade prompt performance for AI Teams

There is no way you can measure your AI drift. variA/Bly helps you evaluate and A/B/n test prompts scientifically, so you catch issues before users complain. Differentiator: → 41-dimensional evaluation -quality scored across multiple dimensions → Statistical A/B testing - confidence intervals, not gut feeling → AI-powered optimization - generates better prompts from data → Prompt Registry - version control and deployment Other tools wait for user complaints. variA/Bly measures continuously.

Amit Kumar

Links

Badges

Forums

How are you measuring your AI drift?

variA/Bly - Delivering production-grade prompt performance for AI Teams