Compare

Claude Opus 4.8 vs Gemini 3.1 Pro

Claude Opus 4.8 (Anthropic) and Gemini 3.1 Pro (Google DeepMind) compared on benchmarks, pricing, context window, and use-case rankings.

Claude Opus 4.8 leads on overall intelligence (ECI 157 vs 156).
Claude Opus 4.8 ranks higher for coding (96th vs 71th percentile).

Claude Opus 4.8Anthropic · 1 in the newsGemini 3.1 ProGoogle DeepMind · 2 in the news

Scores

Intelligence (ECI)157156

Coding9671

Math9176

Reasoning & Knowledge7696

Agentic & Tools9488

Specifications

DeveloperAnthropicGoogle DeepMind

FamilyClaudeGemini

ReleasedMay 28, 2026Feb 19, 2026

Parameters——

AvailabilityAPI accessAPI access

Context window1M—

Price — $/M input$5.00—

Price — $/M output$25.00—

Inputstext, image, file—

Outputstext—

Benchmarks

AIME 2024/202598%96%

APEX43%34%

ARC-AGI93%98%

ARC-AGI-272%77%

FrontierMath47%37%

FrontierMath Tier 431%17%

GPQA Diamond91%94%

SimpleBench65%80%

SimpleQA Verified40%77%

WebDev Arena15521461

WeirdML83%72%

GSO (code optimization)—23%

Humanity's Last Exam—46%

METR task horizon—6.4 h

SWE-bench Verified—76%

Terminal-Bench—80%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Is Claude Opus 4.8 better than Gemini 3.1 Pro?

On Epoch AI's Capabilities Index, Claude Opus 4.8 scores higher (157) than Gemini 3.1 Pro (156). The right pick depends on your task — compare their coding, math, and reasoning scores in the table above.

Which is better for coding, Claude Opus 4.8 or Gemini 3.1 Pro?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, Claude Opus 4.8 ranks higher — 96th vs 71th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

More comparisons

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.