Compare

DeepSeek-V3.2 vs Grok 4

DeepSeek-V3.2 (DeepSeek) and Grok 4 (xAI) compared on benchmarks, pricing, context window, and use-case rankings.

Grok 4 leads on overall intelligence (ECI 147 vs 146).
DeepSeek-V3.2 ranks higher for coding (27th vs 15th percentile).

DeepSeek-V3.2DeepSeek · 1 in the newsGrok 4xAI

Scores

Intelligence (ECI)146147

Coding2715

Math3622

Reasoning & Knowledge2648

Agentic & Tools2916

Specifications

DeveloperDeepSeekxAI

FamilyDeepSeekGrok

ReleasedDec 1, 2025Jul 9, 2025

Parameters—3T

AvailabilityOpen weights (unrestricted)API access

Context window131K—

Price — $/M input$0.23—

Price — $/M output$0.34—

Inputstext—

Outputstext—

Benchmarks

AIME 2024/202588%84%

APEX7%15%

ARC-AGI57%67%

ARC-AGI-24%16%

FrontierMath22%20%

FrontierMath Tier 42%2%

GPQA Diamond83%87%

SimpleBench53%61%

SimpleQA Verified28%48%

Terminal-Bench40%27%

WebDev Arena1286—

WeirdML47%46%

GDPval (win/tie rate)—24%

METR task horizon—1.8 h

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Is DeepSeek-V3.2 better than Grok 4?

On Epoch AI's Capabilities Index, Grok 4 scores higher (147) than DeepSeek-V3.2 (146). The right pick depends on your task — compare their coding, math, and reasoning scores in the table above.

Which is better for coding, DeepSeek-V3.2 or Grok 4?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, DeepSeek-V3.2 ranks higher — 27th vs 15th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

More comparisons

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.