Compare
DeepSeek-V4-Pro vs GPT-5.5
Add, remove, or swap models to compare them side by side.
Scores
Intelligence (ECI)150159
Coding7691
Math9197
Reasoning & Knowledge6892
Agentic & Tools9287
Specifications
DeveloperDeepSeekOpenAI
FamilyDeepSeekGPT
ReleasedApr 24, 2026Apr 23, 2026
Parameters1.6T—
AvailabilityOpen weights (unrestricted)API access
Context window1M1.1M
Price — $/M input$0.43$5.00
Price — $/M output$0.87$30.00
Inputstextfile, image, text
Outputstexttext
Benchmarks
AIME 2024/202597%100%
GPQA Diamond90%94%
Humanity's Last Exam36%44%
SciCode50%56%
SimpleBench61%69%
SimpleQA Verified57%63%
SWE-bench Verified78%81%
WebDev Arena14641505
WeirdML49%85%
τ²-bench96%94%
APEX—38%
ARC-AGI—95%
ARC-AGI-2—85%
FrontierMath—52%
FrontierMath Tier 4—35%
GSO (code optimization)—40%
Terminal-Bench—85%
Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.
Popular comparisons
- Claude Fable 5 vs GPT-5.5
- Claude Fable 5 vs Gemini 3.1 Pro
- Claude Fable 5 vs Muse Spark
- Claude Fable 5 vs Qwen 3.7 Max
- Claude Fable 5 vs GLM-5.2
- Claude Fable 5 vs Kimi K2.6
- Claude Fable 5 vs DeepSeek-V4-Pro
- Gemini 3.1 Pro vs GPT-5.5
- GPT-5.5 vs Muse Spark
- GPT-5.5 vs Qwen 3.7 Max
- GLM-5.2 vs GPT-5.5
- GPT-5.5 vs Kimi K2.6
- DeepSeek-V4-Pro vs GPT-5.5
- Gemini 3.1 Pro vs Muse Spark
- Gemini 3.1 Pro vs Qwen 3.7 Max
- Gemini 3.1 Pro vs GLM-5.2
- Gemini 3.1 Pro vs Kimi K2.6
- DeepSeek-V4-Pro vs Gemini 3.1 Pro
- Muse Spark vs Qwen 3.7 Max
- GLM-5.2 vs Muse Spark
- Kimi K2.6 vs Muse Spark
- DeepSeek-V4-Pro vs Muse Spark
- GLM-5.2 vs Qwen 3.7 Max
- Kimi K2.6 vs Qwen 3.7 Max
- DeepSeek-V4-Pro vs Qwen 3.7 Max
- GLM-5.2 vs Kimi K2.6
- DeepSeek-V4-Pro vs GLM-5.2
- DeepSeek-V4-Pro vs Kimi K2.6
- GLM-5 vs GLM-5.2
- GPT-5 vs GPT-5.5
- Kimi K2.5 vs Kimi K2.6
- Kimi K2 vs Kimi K2.6
Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.