← All models
Compare

Gemini 3.1 Pro vs Grok Build 0.1

Gemini 3.1 Pro (Google DeepMind) and Grok Build 0.1 (xAI) compared on benchmarks, pricing, context window, and use-case rankings.

Gemini 3.1 ProGoogle DeepMind · 3 in the newsGrok Build 0.1xAI
Scores
Intelligence (ECI)155
Coding68
Math77
Reasoning & Knowledge96
Agentic & Tools88
Specifications
DeveloperGoogle DeepMindxAI
FamilyGeminiGrok
ReleasedFeb 19, 2026May 20, 2026
Parameters
AvailabilityAPI access
Context window256K
Price — $/M input$1.00
Price — $/M output$2.00
Inputstext, image
Outputstext
Benchmarks
AIME 2024/202596%
APEX34%
ARC-AGI98%
ARC-AGI-277%
FrontierMath37%
FrontierMath Tier 417%
GPQA Diamond94%
GSO (code optimization)23%
Humanity's Last Exam46%
METR task horizon6.4 h
SimpleBench80%
SimpleQA Verified77%
SWE-bench Verified76%
Terminal-Bench80%
WebDev Arena1461
WeirdML72%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Want a different match-up? Open the compare tool to add or swap models.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.