← All models
Compare

GPT-5.5 vs Grok 4

GPT-5.5 (OpenAI) and Grok 4 (xAI) compared on benchmarks, pricing, context window, and use-case rankings.

GPT-5.5OpenAI · 14 in the newsGrok 4xAI
Scores
Intelligence (ECI)159147
Coding8915
Math9722
Reasoning & Knowledge9148
Agentic & Tools9516
Specifications
DeveloperOpenAIxAI
FamilyGPTGrok
ReleasedApr 23, 2026Jul 9, 2025
Parameters3T
AvailabilityAPI accessAPI access
Context window1.1M
Price — $/M input$5.00
Price — $/M output$30.00
Inputsfile, image, text
Outputstext
Benchmarks
AIME 2024/2025100%84%
APEX38%15%
ARC-AGI95%67%
ARC-AGI-285%16%
FrontierMath52%20%
FrontierMath Tier 435%2%
GPQA Diamond94%87%
GSO (code optimization)40%
SimpleBench69%61%
SimpleQA Verified63%48%
SWE-bench Verified81%
Terminal-Bench85%27%
WebDev Arena1505
WeirdML85%46%
GDPval (win/tie rate)24%
METR task horizon1.8 h

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Is GPT-5.5 better than Grok 4?

On Epoch AI's Capabilities Index, GPT-5.5 scores higher (159) than Grok 4 (147). The right pick depends on your task — compare their coding, math, and reasoning scores in the table above.

Which is better for coding, GPT-5.5 or Grok 4?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, GPT-5.5 ranks higher — 89th vs 15th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.