Compare
Grok 4 vs Kimi K2.5
Add, remove, or swap models to compare them side by side.
Scores
Intelligence (ECI)147148
Coding1546
Math2253
Reasoning & Knowledge4837
Agentic & Tools1639
Specifications
DeveloperxAIMoonshot
FamilyGrokKimi
ReleasedJul 9, 2025Feb 2, 2026
Parameters3T1T
AvailabilityAPI accessOpen weights (unrestricted)
Context window—262K
Price — $/M input—$0.38
Price — $/M output—$2.02
Inputs—text, image
Outputs—text
Benchmarks
AIME 2024/202584%92%
APEX15%14%
ARC-AGI67%65%
ARC-AGI-216%12%
FrontierMath20%28%
FrontierMath Tier 42%4%
GDPval (win/tie rate)24%—
GPQA Diamond87%88%
METR task horizon1.8 h—
SimpleBench61%47%
SimpleQA Verified48%34%
Terminal-Bench27%43%
WeirdML46%46%
Humanity's Last Exam—24%
SWE-bench Verified—74%
WebDev Arena—1431
Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.
Popular comparisons
- Claude Opus 4.8 vs GPT-5.5
- Gemini 3.1 Pro vs GPT-5.5
- GLM-5.1 vs GPT-5.5
- GPT-5.5 vs Kimi K2.5
- GPT-5.5 vs Grok 4
- DeepSeek-V3.2 vs GPT-5.5
- Claude Opus 4.8 vs Gemini 3.1 Pro
- Claude Opus 4.8 vs GLM-5.1
- Claude Opus 4.8 vs Kimi K2.5
- Claude Opus 4.8 vs Grok 4
- Claude Opus 4.8 vs DeepSeek-V3.2
- Gemini 3.1 Pro vs GLM-5.1
- Gemini 3.1 Pro vs Kimi K2.5
- Gemini 3.1 Pro vs Grok 4
- DeepSeek-V3.2 vs Gemini 3.1 Pro
- GLM-5.1 vs Kimi K2.5
- GLM-5.1 vs Grok 4
- DeepSeek-V3.2 vs GLM-5.1
- Grok 4 vs Kimi K2.5
- DeepSeek-V3.2 vs Kimi K2.5
- DeepSeek-V3.2 vs Grok 4
- GLM-5 vs GLM-5.1
- GPT-5 vs GPT-5.5
Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.