Compare
Gemini 3.1 Pro vs GLM-5.2
Gemini 3.1 Pro (Google DeepMind) and GLM-5.2 (Z.ai (Zhipu AI)) compared on benchmarks, pricing, context window, and use-case rankings.
- GLM-5.2 ranks higher for coding (91th vs 68th percentile).
Scores
Intelligence (ECI)156—
Coding6891
Math7633
Reasoning & Knowledge9670
Agentic & Tools88—
Specifications
DeveloperGoogle DeepMindZ.ai (Zhipu AI)
FamilyGemini—
ReleasedFeb 19, 2026Jun 16, 2026
Parameters——
AvailabilityAPI accessOpen weights (unrestricted)
Context window—1M
Price — $/M input—$0.95
Price — $/M output—$3.00
Inputs—text
Outputs—text
Benchmarks
AIME 2024/202596%86%
APEX34%—
ARC-AGI98%—
ARC-AGI-277%—
FrontierMath37%—
FrontierMath Tier 417%—
GPQA Diamond94%92%
GSO (code optimization)23%—
Humanity's Last Exam46%—
METR task horizon6.4 h—
SimpleBench80%—
SimpleQA Verified77%38%
SWE-bench Verified76%79%
Terminal-Bench80%—
WebDev Arena14611593
WeirdML72%70%
Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.
Frequently asked questions
Which is better for coding, Gemini 3.1 Pro or GLM-5.2?
Across coding benchmarks like SWE-bench Verified and Terminal-Bench, GLM-5.2 ranks higher — 91th vs 68th percentile among the models tracked on Model Beat.
Want a different match-up? Open the compare tool to add or swap models.
More comparisons
Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.