← All models
Compare

DeepSeek-V3.2 vs Grok 4

DeepSeek-V3.2 (DeepSeek) and Grok 4 (xAI) compared on benchmarks, pricing, context window, and use-case rankings.

DeepSeek-V3.2DeepSeek · 1 in the newsGrok 4xAI
Scores
Intelligence (ECI)146147
Coding2715
Math3622
Reasoning & Knowledge2648
Agentic & Tools2916
Specifications
DeveloperDeepSeekxAI
FamilyDeepSeekGrok
ReleasedDec 1, 2025Jul 9, 2025
Parameters3T
AvailabilityOpen weights (unrestricted)API access
Context window131K
Price — $/M input$0.23
Price — $/M output$0.34
Inputstext
Outputstext
Benchmarks
AIME 2024/202588%84%
APEX7%15%
ARC-AGI57%67%
ARC-AGI-24%16%
FrontierMath22%20%
FrontierMath Tier 42%2%
GPQA Diamond83%87%
SimpleBench53%61%
SimpleQA Verified28%48%
Terminal-Bench40%27%
WebDev Arena1286
WeirdML47%46%
GDPval (win/tie rate)24%
METR task horizon1.8 h

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Is DeepSeek-V3.2 better than Grok 4?

On Epoch AI's Capabilities Index, Grok 4 scores higher (147) than DeepSeek-V3.2 (146). The right pick depends on your task — compare their coding, math, and reasoning scores in the table above.

Which is better for coding, DeepSeek-V3.2 or Grok 4?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, DeepSeek-V3.2 ranks higher — 27th vs 15th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.