Compare

Claude Opus 4.8 vs Grok 4

Add, remove, or swap models to compare them side by side.

Claude Opus 4.8Grok 4

Claude Opus 4.8Anthropic · 1 in the newsGrok 4xAI

Scores

Intelligence (ECI)157147

Coding9615

Math9122

Reasoning & Knowledge7648

Agentic & Tools9416

Specifications

DeveloperAnthropicxAI

FamilyClaudeGrok

ReleasedMay 28, 2026Jul 9, 2025

Parameters—3T

AvailabilityAPI accessAPI access

Context window1M—

Price — $/M input$5.00—

Price — $/M output$25.00—

Inputstext, image, file—

Outputstext—

Benchmarks

AIME 2024/202598%84%

APEX43%15%

ARC-AGI93%67%

ARC-AGI-272%16%

FrontierMath47%20%

FrontierMath Tier 431%2%

GPQA Diamond91%87%

SimpleBench65%61%

SimpleQA Verified40%48%

WebDev Arena1552—

WeirdML83%46%

GDPval (win/tie rate)—24%

METR task horizon—1.8 h

Terminal-Bench—27%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.

Claude Opus 4.8 vs Grok 4

Popular comparisons