← All models
Compare

Grok 4 vs Kimi K2.5

Add, remove, or swap models to compare them side by side.

Grok 4Kimi K2.5
Grok 4xAIKimi K2.5Moonshot · 10 in the news
Scores
Intelligence (ECI)147148
Coding1546
Math2253
Reasoning & Knowledge4837
Agentic & Tools1639
Specifications
DeveloperxAIMoonshot
FamilyGrokKimi
ReleasedJul 9, 2025Feb 2, 2026
Parameters3T1T
AvailabilityAPI accessOpen weights (unrestricted)
Context window262K
Price — $/M input$0.38
Price — $/M output$2.02
Inputstext, image
Outputstext
Benchmarks
AIME 2024/202584%92%
APEX15%14%
ARC-AGI67%65%
ARC-AGI-216%12%
FrontierMath20%28%
FrontierMath Tier 42%4%
GDPval (win/tie rate)24%
GPQA Diamond87%88%
METR task horizon1.8 h
SimpleBench61%47%
SimpleQA Verified48%34%
Terminal-Bench27%43%
WeirdML46%46%
Humanity's Last Exam24%
SWE-bench Verified74%
WebDev Arena1431

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.