← All models
Compare

Claude Opus 4.8 vs Kimi K2.5

Claude Opus 4.8 (Anthropic) and Kimi K2.5 (Moonshot) compared on benchmarks, pricing, context window, and use-case rankings.

Claude Opus 4.8Anthropic · 1 in the newsKimi K2.5Moonshot · 10 in the news
Scores
Intelligence (ECI)157148
Coding9646
Math9153
Reasoning & Knowledge7637
Agentic & Tools9439
Specifications
DeveloperAnthropicMoonshot
FamilyClaudeKimi
ReleasedMay 28, 2026Feb 2, 2026
Parameters1T
AvailabilityAPI accessOpen weights (unrestricted)
Context window1M262K
Price — $/M input$5.00$0.38
Price — $/M output$25.00$2.02
Inputstext, image, filetext, image
Outputstexttext
Benchmarks
AIME 2024/202598%92%
APEX43%14%
ARC-AGI93%65%
ARC-AGI-272%12%
FrontierMath47%28%
FrontierMath Tier 431%4%
GPQA Diamond91%88%
SimpleBench65%47%
SimpleQA Verified40%34%
WebDev Arena15521431
WeirdML83%46%
Humanity's Last Exam24%
SWE-bench Verified74%
Terminal-Bench43%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Is Claude Opus 4.8 better than Kimi K2.5?

On Epoch AI's Capabilities Index, Claude Opus 4.8 scores higher (157) than Kimi K2.5 (148). The right pick depends on your task — compare their coding, math, and reasoning scores in the table above.

Which is cheaper, Claude Opus 4.8 or Kimi K2.5?

Kimi K2.5 is cheaper on input tokens at $0.38 per million, versus $5.00 (representative OpenRouter pricing).

Which has a larger context window, Claude Opus 4.8 or Kimi K2.5?

Claude Opus 4.8 supports up to 1M tokens, compared with 262K for the other.

Which is better for coding, Claude Opus 4.8 or Kimi K2.5?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, Claude Opus 4.8 ranks higher — 96th vs 46th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.