Sup AI

AI ensemble that scored #1 on Humanity's Last Exam

148 followers

AI ensemble that scored #1 on Humanity's Last Exam

148 followers

Visit website

AI Chatbots

Every LLM hallucinates. They just don't hallucinate the same things. Sup AI runs multiple LLMs (out of 339) in parallel, then synthesizes answers by measuring confidence on every segment. High entropy = likely hallucination, downweighted. Low entropy = likely accurate, amplified. Result: 52.15% on Humanity's Last Exam, 7.41 points ahead of any individual model. $10 starter credit. Card verified. No auto-charge.

Overview
Reviews
Alternatives
Built with
Team
More

Sup AI makers

Here are the founders, developers, designers and product people who worked on Sup AI

Scott Mueller AI Research Scientist

Sup AI

Ken Mueller CEO at Sup AI · Stanford

Sup AI