Waqas Ahmad

Waqas Ahmad

The QA expert for AI-built websites

About

QA Engineer with 8+ years of experience helping teams ship reliable, user-friendly products. I’ve tested 1500+ websites and apps, focusing on usability, UX, and real user flows. Currently building an AI-powered website audit tool designed to go beyond basic checks—testing like a real QA engineer and delivering actionable insights on usability, functionality, bugs, responsiveness, and UI/UX.

Badges

Gemologist
Gemologist
Tastemaker
Tastemaker
Gone streaking
Gone streaking

Forums

speedy_devv

2d ago

Anyone else running Opus 4.7 yet? This one feels different (with CC harness)

Anthropic just shipped Opus 4.7 today and i had to write about it somewhere because the jump is weird.

I ran the same backlog task on 4.6 and 4.7 back to back. same repo, same prompt, same tools. 4.6 looped on a bug for 25 minutes and was not going to solve it. 4.7 closed it in eleven, and the part that freaked me out is that it paused in the middle to sanity-check an assumption i had not asked it to check. literally wrote "before i write this migration, let me verify the actual shape of the response object, because my assumption here might be wrong" and then went and verified it. unprompted.

That self-verification behavior is the thing. Vercel is reporting it does proofs on systems code before starting work. Hex says it flags missing data instead of making up plausible-but-wrong fallbacks. Genspark measured loop rates on hard queries and 4.7 basically stopped looping. different teams, different harnesses, same pattern.

the numbers are nuts too:

View more