Compare
Claude Sonnet 5 vs Gemini 3.1 Pro
Claude Sonnet 5 (Anthropic) and Gemini 3.1 Pro (Google DeepMind) compared on benchmarks, pricing, context window, and use-case rankings.
- Claude Sonnet 5 ranks higher for coding (94th vs 68th percentile).
Scores
Intelligence (ECI)—155
Coding9468
Math—77
Reasoning & Knowledge8896
Agentic & Tools—88
Specifications
DeveloperAnthropicGoogle DeepMind
FamilyClaudeGemini
ReleasedJun 30, 2026Feb 19, 2026
Parameters——
Availability—API access
Context window1M—
Price — $/M input$2.00—
Price — $/M output$10.00—
Inputstext, image, file—
Outputstext—
Benchmarks
GPQA Diamond91%94%
Humanity's Last Exam40%46%
SciCode54%—
AIME 2024/2025—96%
APEX—34%
ARC-AGI—98%
ARC-AGI-2—77%
FrontierMath—37%
FrontierMath Tier 4—17%
GSO (code optimization)—23%
METR task horizon—6.4 h
SimpleBench—80%
SimpleQA Verified—77%
SWE-bench Verified—76%
Terminal-Bench—80%
WebDev Arena—1461
WeirdML—72%
Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.
Frequently asked questions
Which is better for coding, Claude Sonnet 5 or Gemini 3.1 Pro?
Across coding benchmarks like SWE-bench Verified and Terminal-Bench, Claude Sonnet 5 ranks higher — 94th vs 68th percentile among the models tracked on Model Beat.
Want a different match-up? Open the compare tool to add or swap models.
More comparisons
Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.