Compare

GPT-5.5 vs Grok 4

GPT-5.5 (OpenAI) and Grok 4 (xAI) compared on benchmarks, pricing, context window, and use-case rankings.

GPT-5.5 leads on overall intelligence (ECI 159 vs 147).
GPT-5.5 ranks higher for coding (89th vs 15th percentile).

GPT-5.5OpenAI · 14 in the newsGrok 4xAI

Scores

Intelligence (ECI)159147

Coding8915

Math9722

Reasoning & Knowledge9148

Agentic & Tools9516

Specifications

DeveloperOpenAIxAI

FamilyGPTGrok

ReleasedApr 23, 2026Jul 9, 2025

Parameters—3T

AvailabilityAPI accessAPI access

Context window1.1M—

Price — $/M input$5.00—

Price — $/M output$30.00—

Inputsfile, image, text—

Outputstext—

Benchmarks

AIME 2024/2025100%84%

APEX38%15%

ARC-AGI95%67%

ARC-AGI-285%16%

FrontierMath52%20%

FrontierMath Tier 435%2%

GPQA Diamond94%87%

GSO (code optimization)40%—

SimpleBench69%61%

SimpleQA Verified63%48%

SWE-bench Verified81%—

Terminal-Bench85%27%

WebDev Arena1505—

WeirdML85%46%

GDPval (win/tie rate)—24%

METR task horizon—1.8 h

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Is GPT-5.5 better than Grok 4?

On Epoch AI's Capabilities Index, GPT-5.5 scores higher (159) than Grok 4 (147). The right pick depends on your task — compare their coding, math, and reasoning scores in the table above.

Which is better for coding, GPT-5.5 or Grok 4?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, GPT-5.5 ranks higher — 89th vs 15th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

More comparisons

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.