← All models
Compare

DeepSeek-V3.2 vs GLM-5.1

DeepSeek-V3.2 (DeepSeek) and GLM-5.1 (Z.ai (Zhipu AI)) compared on benchmarks, pricing, context window, and use-case rankings.

DeepSeek-V3.2DeepSeek · 1 in the newsGLM-5.1Z.ai (Zhipu AI) · 5 in the news
Scores
Intelligence (ECI)146150
Coding2769
Math3663
Reasoning & Knowledge2648
Agentic & Tools29
Specifications
DeveloperDeepSeekZ.ai (Zhipu AI)
FamilyDeepSeek
ReleasedDec 1, 2025Apr 7, 2026
Parameters754B
AvailabilityOpen weights (unrestricted)
Context window131K203K
Price — $/M input$0.23$0.98
Price — $/M output$0.34$3.08
Inputstexttext
Outputstexttext
Benchmarks
AIME 2024/202588%92%
APEX7%
ARC-AGI57%
ARC-AGI-24%
FrontierMath22%33%
FrontierMath Tier 42%13%
GPQA Diamond83%85%
SimpleBench53%59%
SimpleQA Verified28%37%
Terminal-Bench40%
WebDev Arena12861534
WeirdML47%57%
SWE-bench Verified74%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Is DeepSeek-V3.2 better than GLM-5.1?

On Epoch AI's Capabilities Index, GLM-5.1 scores higher (150) than DeepSeek-V3.2 (146). The right pick depends on your task — compare their coding, math, and reasoning scores in the table above.

Which is cheaper, DeepSeek-V3.2 or GLM-5.1?

DeepSeek-V3.2 is cheaper on input tokens at $0.23 per million, versus $0.98 (representative OpenRouter pricing).

Which has a larger context window, DeepSeek-V3.2 or GLM-5.1?

GLM-5.1 supports up to 203K tokens, compared with 131K for the other.

Which is better for coding, DeepSeek-V3.2 or GLM-5.1?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, GLM-5.1 ranks higher — 69th vs 27th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.