Compare
Gemini 3.1 Pro vs Qwen3.7 Plus
Gemini 3.1 Pro (Google DeepMind) and Qwen3.7 Plus (Qwen) compared on benchmarks, pricing, context window, and use-case rankings.
- Gemini 3.1 Pro ranks higher for coding (68th vs 65th percentile).
Scores
Intelligence (ECI)155—
Coding6865
Math77—
Reasoning & Knowledge9678
Agentic & Tools8867
Specifications
DeveloperGoogle DeepMindQwen
FamilyGeminiQwen
ReleasedFeb 19, 2026Jun 3, 2026
Parameters——
AvailabilityAPI access—
Context window—1M
Price — $/M input—$0.32
Price — $/M output—$1.28
Inputs—text, image
Outputs—text
Benchmarks
AIME 2024/202596%—
APEX34%—
ARC-AGI98%—
ARC-AGI-277%—
FrontierMath37%—
FrontierMath Tier 417%—
GPQA Diamond94%90%
GSO (code optimization)23%—
Humanity's Last Exam46%33%
METR task horizon6.4 h—
SimpleBench80%—
SimpleQA Verified77%—
SWE-bench Verified76%—
Terminal-Bench80%—
WebDev Arena1461—
WeirdML72%—
SciCode—46%
τ²-bench—93%
Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.
Frequently asked questions
Which is better for coding, Gemini 3.1 Pro or Qwen3.7 Plus?
Across coding benchmarks like SWE-bench Verified and Terminal-Bench, Gemini 3.1 Pro ranks higher — 68th vs 65th percentile among the models tracked on Model Beat.
Want a different match-up? Open the compare tool to add or swap models.
More comparisons
Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.