Compare

Claude Sonnet 5 vs Gemini 3.1 Pro

Claude Sonnet 5 (Anthropic) and Gemini 3.1 Pro (Google DeepMind) compared on benchmarks, pricing, context window, and use-case rankings.

Claude Sonnet 5 ranks higher for coding (94th vs 68th percentile).

Claude Sonnet 5Anthropic · 3 in the newsGemini 3.1 ProGoogle DeepMind · 3 in the news

Scores

Intelligence (ECI)—155

Coding9468

Math—77

Reasoning & Knowledge8896

Agentic & Tools—88

Specifications

DeveloperAnthropicGoogle DeepMind

FamilyClaudeGemini

ReleasedJun 30, 2026Feb 19, 2026

Parameters——

Availability—API access

Context window1M—

Price — $/M input$2.00—

Price — $/M output$10.00—

Inputstext, image, file—

Outputstext—

Benchmarks

GPQA Diamond91%94%

Humanity's Last Exam40%46%

SciCode54%—

AIME 2024/2025—96%

APEX—34%

ARC-AGI—98%

ARC-AGI-2—77%

FrontierMath—37%

FrontierMath Tier 4—17%

GSO (code optimization)—23%

METR task horizon—6.4 h

SimpleBench—80%

SimpleQA Verified—77%

SWE-bench Verified—76%

Terminal-Bench—80%

WebDev Arena—1461

WeirdML—72%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Which is better for coding, Claude Sonnet 5 or Gemini 3.1 Pro?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, Claude Sonnet 5 ranks higher — 94th vs 68th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

More comparisons

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.