Compare

Gemini 3.1 Pro vs Grok Build 0.1

Gemini 3.1 Pro (Google DeepMind) and Grok Build 0.1 (xAI) compared on benchmarks, pricing, context window, and use-case rankings.

Gemini 3.1 ProGoogle DeepMind · 3 in the newsGrok Build 0.1xAI

Scores

Intelligence (ECI)155—

Coding68—

Math77—

Reasoning & Knowledge96—

Agentic & Tools88—

Specifications

DeveloperGoogle DeepMindxAI

FamilyGeminiGrok

ReleasedFeb 19, 2026May 20, 2026

Parameters——

AvailabilityAPI access—

Context window—256K

Price — $/M input—$1.00

Price — $/M output—$2.00

Inputs—text, image

Outputs—text

Benchmarks

AIME 2024/202596%—

APEX34%—

ARC-AGI98%—

ARC-AGI-277%—

FrontierMath37%—

FrontierMath Tier 417%—

GPQA Diamond94%—

GSO (code optimization)23%—

Humanity's Last Exam46%—

METR task horizon6.4 h—

SimpleBench80%—

SimpleQA Verified77%—

SWE-bench Verified76%—

Terminal-Bench80%—

WebDev Arena1461—

WeirdML72%—

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Want a different match-up? Open the compare tool to add or swap models.

More comparisons

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.