Compare
Claude Opus 4.8 vs Gemini 3.1 Pro
Add, remove, or swap models to compare them side by side.
Scores
Intelligence (ECI)157156
Coding9671
Math9176
Reasoning & Knowledge7696
Agentic & Tools9488
Specifications
DeveloperAnthropicGoogle DeepMind
FamilyClaudeGemini
ReleasedMay 28, 2026Feb 19, 2026
Parameters——
AvailabilityAPI accessAPI access
Context window1M—
Price — $/M input$5.00—
Price — $/M output$25.00—
Inputstext, image, file—
Outputstext—
Benchmarks
AIME 2024/202598%96%
APEX43%34%
ARC-AGI93%98%
ARC-AGI-272%77%
FrontierMath47%37%
FrontierMath Tier 431%17%
GPQA Diamond91%94%
SimpleBench65%80%
SimpleQA Verified40%77%
WebDev Arena15521461
WeirdML83%72%
GSO (code optimization)—23%
Humanity's Last Exam—46%
METR task horizon—6.4 h
SWE-bench Verified—76%
Terminal-Bench—80%
Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.
Popular comparisons
- Claude Opus 4.8 vs GPT-5.5
- Gemini 3.1 Pro vs GPT-5.5
- GLM-5.1 vs GPT-5.5
- GPT-5.5 vs Kimi K2.5
- GPT-5.5 vs Grok 4
- DeepSeek-V3.2 vs GPT-5.5
- Claude Opus 4.8 vs Gemini 3.1 Pro
- Claude Opus 4.8 vs GLM-5.1
- Claude Opus 4.8 vs Kimi K2.5
- Claude Opus 4.8 vs Grok 4
- Claude Opus 4.8 vs DeepSeek-V3.2
- Gemini 3.1 Pro vs GLM-5.1
- Gemini 3.1 Pro vs Kimi K2.5
- Gemini 3.1 Pro vs Grok 4
- DeepSeek-V3.2 vs Gemini 3.1 Pro
- GLM-5.1 vs Kimi K2.5
- GLM-5.1 vs Grok 4
- DeepSeek-V3.2 vs GLM-5.1
- Grok 4 vs Kimi K2.5
- DeepSeek-V3.2 vs Kimi K2.5
- DeepSeek-V3.2 vs Grok 4
- GLM-5 vs GLM-5.1
- GPT-5 vs GPT-5.5
Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.