Compare

Claude Opus 4.8 vs Gemma 4 26B A4B

Claude Opus 4.8 (Anthropic) and Gemma 4 26B A4B (Google DeepMind) compared on benchmarks, pricing, context window, and use-case rankings.

Claude Opus 4.8 ranks higher for coding (93th vs 26th percentile).

Claude Opus 4.8Anthropic · 1 in the newsGemma 4 26B A4BGoogle DeepMind

Scores

Intelligence (ECI)157—

Coding9326

Math91—

Reasoning & Knowledge754

Agentic & Tools94—

Specifications

DeveloperAnthropicGoogle DeepMind

FamilyClaudeGemma

ReleasedMay 28, 2026Apr 2, 2026

Parameters——

AvailabilityAPI accessOpen weights (unrestricted)

Context window1M—

Price — $/M input$5.00—

Price — $/M output$25.00—

Inputstext, image, file—

Outputstext—

Benchmarks

AIME 2024/202598%—

APEX43%—

ARC-AGI93%—

ARC-AGI-272%—

FrontierMath47%—

FrontierMath Tier 431%—

GPQA Diamond91%—

SimpleBench65%—

SimpleQA Verified40%—

WebDev Arena15521359

WeirdML83%35%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Which is better for coding, Claude Opus 4.8 or Gemma 4 26B A4B?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, Claude Opus 4.8 ranks higher — 93th vs 26th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

More comparisons

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.