fmerian

Next.js Evals - Performance results of AI coding agents on Next.js

Performance results of AI coding agents on Next.js code generation and migration tasks, measuring success rate and execution time.

Add a comment

Replies

Best
fmerian

In a recent thread, we debated what the best AI coding models are, and some community members rightly pointed out that, well, it depends.

@Next.js recently released their benchmark, updated daily, and currently, @OpenAI's GPT 5.3 Codex (xhigh) is achieving 90% on evals out of the box.

Now we know! Until the next model release.

View source code on GitHub