Compare
GLM-5.1 vs Grok 4
Add, remove, or swap models to compare them side by side.
Scores
Intelligence (ECI)150147
Coding6915
Math6322
Reasoning & Knowledge4848
Agentic & Tools—16
Specifications
DeveloperZ.ai (Zhipu AI)xAI
Family—Grok
ReleasedApr 7, 2026Jul 9, 2025
Parameters754B3T
Availability—API access
Context window203K—
Price — $/M input$0.98—
Price — $/M output$3.08—
Inputstext—
Outputstext—
Benchmarks
AIME 2024/202592%84%
FrontierMath33%20%
FrontierMath Tier 413%2%
GPQA Diamond85%87%
SimpleBench59%61%
SimpleQA Verified37%48%
SWE-bench Verified74%—
WebDev Arena1534—
WeirdML57%46%
APEX—15%
ARC-AGI—67%
ARC-AGI-2—16%
GDPval (win/tie rate)—24%
METR task horizon—1.8 h
Terminal-Bench—27%
Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.
Popular comparisons
- Claude Opus 4.8 vs GPT-5.5
- Gemini 3.1 Pro vs GPT-5.5
- GLM-5.1 vs GPT-5.5
- GPT-5.5 vs Kimi K2.5
- GPT-5.5 vs Grok 4
- DeepSeek-V3.2 vs GPT-5.5
- Claude Opus 4.8 vs Gemini 3.1 Pro
- Claude Opus 4.8 vs GLM-5.1
- Claude Opus 4.8 vs Kimi K2.5
- Claude Opus 4.8 vs Grok 4
- Claude Opus 4.8 vs DeepSeek-V3.2
- Gemini 3.1 Pro vs GLM-5.1
- Gemini 3.1 Pro vs Kimi K2.5
- Gemini 3.1 Pro vs Grok 4
- DeepSeek-V3.2 vs Gemini 3.1 Pro
- GLM-5.1 vs Kimi K2.5
- GLM-5.1 vs Grok 4
- DeepSeek-V3.2 vs GLM-5.1
- Grok 4 vs Kimi K2.5
- DeepSeek-V3.2 vs Kimi K2.5
- DeepSeek-V3.2 vs Grok 4
- GLM-5 vs GLM-5.1
- GPT-5 vs GPT-5.5
Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.