Compare

Grok 4 vs Kimi K2.5

Add, remove, or swap models to compare them side by side.

Grok 4Kimi K2.5

Grok 4xAIKimi K2.5Moonshot · 10 in the news

Scores

Intelligence (ECI)147148

Coding1546

Math2253

Reasoning & Knowledge4837

Agentic & Tools1639

Specifications

DeveloperxAIMoonshot

FamilyGrokKimi

ReleasedJul 9, 2025Feb 2, 2026

Parameters3T1T

AvailabilityAPI accessOpen weights (unrestricted)

Context window—262K

Price — $/M input—$0.38

Price — $/M output—$2.02

Inputs—text, image

Outputs—text

Benchmarks

AIME 2024/202584%92%

APEX15%14%

ARC-AGI67%65%

ARC-AGI-216%12%

FrontierMath20%28%

FrontierMath Tier 42%4%

GDPval (win/tie rate)24%—

GPQA Diamond87%88%

METR task horizon1.8 h—

SimpleBench61%47%

SimpleQA Verified48%34%

Terminal-Bench27%43%

WeirdML46%46%

Humanity's Last Exam—24%

SWE-bench Verified—74%

WebDev Arena—1431

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.

Grok 4 vs Kimi K2.5

Popular comparisons