Compare

Gemini 3.1 Pro vs Qwen3.5-9B

Gemini 3.1 Pro (Google DeepMind) and Qwen3.5-9B (Alibaba) compared on benchmarks, pricing, context window, and use-case rankings.

Gemini 3.1 Pro ranks higher for coding (68th vs 2th percentile).

Gemini 3.1 ProGoogle DeepMind · 3 in the newsQwen3.5-9BAlibaba

Scores

Intelligence (ECI)156—

Coding682

Math76—

Reasoning & Knowledge96—

Agentic & Tools882

Specifications

DeveloperGoogle DeepMindAlibaba

FamilyGeminiQwen

ReleasedFeb 19, 2026Feb 24, 2026

Parameters——

AvailabilityAPI accessOpen weights (restricted use)

Context window—262K

Price — $/M input—$0.10

Price — $/M output—$0.15

Inputs—text, image, video

Outputs—text

Benchmarks

AIME 2024/202596%—

APEX34%—

ARC-AGI98%—

ARC-AGI-277%—

FrontierMath37%—

FrontierMath Tier 417%—

GPQA Diamond94%—

GSO (code optimization)23%—

Humanity's Last Exam46%—

METR task horizon6.4 h—

SimpleBench80%—

SimpleQA Verified77%—

SWE-bench Verified76%—

Terminal-Bench80%9%

WebDev Arena1461—

WeirdML72%—

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Which is better for coding, Gemini 3.1 Pro or Qwen3.5-9B?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, Gemini 3.1 Pro ranks higher — 68th vs 2th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

More comparisons

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.