← All models
Compare

Gemini 3.1 Pro vs Qwen 3.7 Max

Gemini 3.1 Pro (Google DeepMind) and Qwen 3.7 Max (Alibaba) compared on benchmarks, pricing, context window, and use-case rankings.

Gemini 3.1 ProGoogle DeepMind · 3 in the newsQwen 3.7 MaxAlibaba · 4 in the news
Scores
Intelligence (ECI)155153
Coding6881
Math7778
Reasoning & Knowledge9685
Agentic & Tools8878
Specifications
DeveloperGoogle DeepMindAlibaba
FamilyGeminiQwen
ReleasedFeb 19, 2026May 19, 2026
Parameters
AvailabilityAPI access
Context window1M
Price — $/M input$1.25
Price — $/M output$3.75
Inputstext
Outputstext
Benchmarks
AIME 2024/202596%95%
APEX34%
ARC-AGI98%
ARC-AGI-277%
FrontierMath37%
FrontierMath Tier 417%
GPQA Diamond94%92%
GSO (code optimization)23%
Humanity's Last Exam46%38%
METR task horizon6.4 h
SimpleBench80%70%
SimpleQA Verified77%59%
SWE-bench Verified76%77%
Terminal-Bench80%
WebDev Arena14611541
WeirdML72%
SciCode49%
τ²-bench95%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Is Gemini 3.1 Pro better than Qwen 3.7 Max?

On Epoch AI's Capabilities Index, Gemini 3.1 Pro scores higher (155) than Qwen 3.7 Max (153). The right pick depends on your task — compare their coding, math, and reasoning scores in the table above.

Which is better for coding, Gemini 3.1 Pro or Qwen 3.7 Max?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, Qwen 3.7 Max ranks higher — 81th vs 68th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.