Best AI models for reasoning
Ranked by a 0–100 composite of reasoning and knowledge benchmarks (GPQA Diamond, Humanity's Last Exam, SimpleQA Verified, SimpleBench), scored as percentile rank against every model released in the past year.
Reasoning & Knowledge score is a 0–100 percentile composite across that area’s benchmarks. Open a model for the raw scores.
Frequently asked questions
What is the best AI model for reasoning?
Claude Fable 5 (Anthropic) currently ranks first for reasoning on Model Beat, followed by Gemini 3.1 Pro and GPT-5.3 Codex. Ranked by a 0–100 composite of reasoning and knowledge benchmarks (GPQA Diamond, Humanity's Last Exam, SimpleQA Verified, SimpleBench), scored as percentile rank against every model released in the past year.
Which is the most affordable strong reasoning model?
Among the top-ranked reasoning models, Qwen 3.7 Max is the cheapest at $1.25 per million input tokens.
How are these reasoning rankings calculated?
Ranked by a 0–100 composite of reasoning and knowledge benchmarks (GPQA Diamond, Humanity's Last Exam, SimpleQA Verified, SimpleBench), scored as percentile rank against every model released in the past year.
Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.