← All models
Compare

GLM-5.1 vs Kimi K2.5

GLM-5.1 (Z.ai (Zhipu AI)) and Kimi K2.5 (Moonshot) compared on benchmarks, pricing, context window, and use-case rankings.

GLM-5.1Z.ai (Zhipu AI) · 5 in the newsKimi K2.5Moonshot · 10 in the news
Scores
Intelligence (ECI)150148
Coding6946
Math6353
Reasoning & Knowledge4837
Agentic & Tools39
Specifications
DeveloperZ.ai (Zhipu AI)Moonshot
FamilyKimi
ReleasedApr 7, 2026Feb 2, 2026
Parameters754B1T
AvailabilityOpen weights (unrestricted)
Context window203K262K
Price — $/M input$0.98$0.38
Price — $/M output$3.08$2.02
Inputstexttext, image
Outputstexttext
Benchmarks
AIME 2024/202592%92%
FrontierMath33%28%
FrontierMath Tier 413%4%
GPQA Diamond85%88%
SimpleBench59%47%
SimpleQA Verified37%34%
SWE-bench Verified74%74%
WebDev Arena15341431
WeirdML57%46%
APEX14%
ARC-AGI65%
ARC-AGI-212%
Humanity's Last Exam24%
Terminal-Bench43%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Is GLM-5.1 better than Kimi K2.5?

On Epoch AI's Capabilities Index, GLM-5.1 scores higher (150) than Kimi K2.5 (148). The right pick depends on your task — compare their coding, math, and reasoning scores in the table above.

Which is cheaper, GLM-5.1 or Kimi K2.5?

Kimi K2.5 is cheaper on input tokens at $0.38 per million, versus $0.98 (representative OpenRouter pricing).

Which has a larger context window, GLM-5.1 or Kimi K2.5?

Kimi K2.5 supports up to 262K tokens, compared with 203K for the other.

Which is better for coding, GLM-5.1 or Kimi K2.5?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, GLM-5.1 ranks higher — 69th vs 46th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.