Compare

Gemini 3.1 Pro vs Qwen 3.7 Max

Gemini 3.1 Pro (Google DeepMind) and Qwen 3.7 Max (Alibaba) compared on benchmarks, pricing, context window, and use-case rankings.

Gemini 3.1 Pro leads on overall intelligence (ECI 155 vs 153).
Qwen 3.7 Max ranks higher for coding (81th vs 68th percentile).

Gemini 3.1 ProGoogle DeepMind · 3 in the newsQwen 3.7 MaxAlibaba · 4 in the news

Scores

Intelligence (ECI)155153

Coding6881

Math7778

Reasoning & Knowledge9685

Agentic & Tools8878

Specifications

DeveloperGoogle DeepMindAlibaba

FamilyGeminiQwen

ReleasedFeb 19, 2026May 19, 2026

Parameters——

AvailabilityAPI access—

Context window—1M

Price — $/M input—$1.25

Price — $/M output—$3.75

Inputs—text

Outputs—text

Benchmarks

AIME 2024/202596%95%

APEX34%—

ARC-AGI98%—

ARC-AGI-277%—

FrontierMath37%—

FrontierMath Tier 417%—

GPQA Diamond94%92%

GSO (code optimization)23%—

Humanity's Last Exam46%38%

METR task horizon6.4 h—

SimpleBench80%70%

SimpleQA Verified77%59%

SWE-bench Verified76%77%

Terminal-Bench80%—

WebDev Arena14611541

WeirdML72%—

SciCode—49%

τ²-bench—95%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Is Gemini 3.1 Pro better than Qwen 3.7 Max?

On Epoch AI's Capabilities Index, Gemini 3.1 Pro scores higher (155) than Qwen 3.7 Max (153). The right pick depends on your task — compare their coding, math, and reasoning scores in the table above.

Which is better for coding, Gemini 3.1 Pro or Qwen 3.7 Max?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, Qwen 3.7 Max ranks higher — 81th vs 68th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

More comparisons

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.