← All models
Compare

Claude Opus 4.8 vs Grok 4

Add, remove, or swap models to compare them side by side.

Claude Opus 4.8Grok 4
Claude Opus 4.8Anthropic · 1 in the newsGrok 4xAI
Scores
Intelligence (ECI)157147
Coding9615
Math9122
Reasoning & Knowledge7648
Agentic & Tools9416
Specifications
DeveloperAnthropicxAI
FamilyClaudeGrok
ReleasedMay 28, 2026Jul 9, 2025
Parameters3T
AvailabilityAPI accessAPI access
Context window1M
Price — $/M input$5.00
Price — $/M output$25.00
Inputstext, image, file
Outputstext
Benchmarks
AIME 2024/202598%84%
APEX43%15%
ARC-AGI93%67%
ARC-AGI-272%16%
FrontierMath47%20%
FrontierMath Tier 431%2%
GPQA Diamond91%87%
SimpleBench65%61%
SimpleQA Verified40%48%
WebDev Arena1552
WeirdML83%46%
GDPval (win/tie rate)24%
METR task horizon1.8 h
Terminal-Bench27%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.

Claude Opus 4.8 vs Grok 4 — comparison