We re trying something new on Thursday: Alpha Day.
The idea is simple. If this is the first time you re launching your product anywhere, you can tag it alpha and get a boost to your points (and land on a special leaderboard).
Launched last week, open-source frontier model @MiniMax M2.7 scores 56.2% on SWE Bench Pro, converging towards the best proprietary models like @Claude by Anthropic Opus 4.6.