Compare

Gemini 3.1 Pro vs Kimi K2.5

Gemini 3.1 Pro (Google DeepMind) and Kimi K2.5 (Moonshot) compared on benchmarks, pricing, context window, and use-case rankings.

Gemini 3.1 Pro leads on overall intelligence (ECI 156 vs 148).
Gemini 3.1 Pro ranks higher for coding (71th vs 46th percentile).

Gemini 3.1 ProGoogle DeepMind · 2 in the newsKimi K2.5Moonshot · 10 in the news

Scores

Intelligence (ECI)156148

Coding7146

Math7653

Reasoning & Knowledge9637

Agentic & Tools8839

Specifications

DeveloperGoogle DeepMindMoonshot

FamilyGeminiKimi

ReleasedFeb 19, 2026Feb 2, 2026

Parameters—1T

AvailabilityAPI accessOpen weights (unrestricted)

Context window—262K

Price — $/M input—$0.38

Price — $/M output—$2.02

Inputs—text, image

Outputs—text

Benchmarks

AIME 2024/202596%92%

APEX34%14%

ARC-AGI98%65%

ARC-AGI-277%12%

FrontierMath37%28%

FrontierMath Tier 417%4%

GPQA Diamond94%88%

GSO (code optimization)23%—

Humanity's Last Exam46%24%

METR task horizon6.4 h—

SimpleBench80%47%

SimpleQA Verified77%34%

SWE-bench Verified76%74%

Terminal-Bench80%43%

WebDev Arena14611431

WeirdML72%46%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Is Gemini 3.1 Pro better than Kimi K2.5?

On Epoch AI's Capabilities Index, Gemini 3.1 Pro scores higher (156) than Kimi K2.5 (148). The right pick depends on your task — compare their coding, math, and reasoning scores in the table above.

Which is better for coding, Gemini 3.1 Pro or Kimi K2.5?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, Gemini 3.1 Pro ranks higher — 71th vs 46th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

More comparisons

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.