← All models
Compare

Claude Opus 4.8 vs Gemini 3.1 Pro

Claude Opus 4.8 (Anthropic) and Gemini 3.1 Pro (Google DeepMind) compared on benchmarks, pricing, context window, and use-case rankings.

Claude Opus 4.8Anthropic · 1 in the newsGemini 3.1 ProGoogle DeepMind · 2 in the news
Scores
Intelligence (ECI)157156
Coding9671
Math9176
Reasoning & Knowledge7696
Agentic & Tools9488
Specifications
DeveloperAnthropicGoogle DeepMind
FamilyClaudeGemini
ReleasedMay 28, 2026Feb 19, 2026
Parameters
AvailabilityAPI accessAPI access
Context window1M
Price — $/M input$5.00
Price — $/M output$25.00
Inputstext, image, file
Outputstext
Benchmarks
AIME 2024/202598%96%
APEX43%34%
ARC-AGI93%98%
ARC-AGI-272%77%
FrontierMath47%37%
FrontierMath Tier 431%17%
GPQA Diamond91%94%
SimpleBench65%80%
SimpleQA Verified40%77%
WebDev Arena15521461
WeirdML83%72%
GSO (code optimization)23%
Humanity's Last Exam46%
METR task horizon6.4 h
SWE-bench Verified76%
Terminal-Bench80%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Is Claude Opus 4.8 better than Gemini 3.1 Pro?

On Epoch AI's Capabilities Index, Claude Opus 4.8 scores higher (157) than Gemini 3.1 Pro (156). The right pick depends on your task — compare their coding, math, and reasoning scores in the table above.

Which is better for coding, Claude Opus 4.8 or Gemini 3.1 Pro?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, Claude Opus 4.8 ranks higher — 96th vs 71th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.