← All models
Compare

Muse Spark vs Step 3.7 Flash

Muse Spark (Meta AI) and Step 3.7 Flash (StepFun) compared on benchmarks, pricing, context window, and use-case rankings.

Scores
Intelligence (ECI)155
Coding8432
Math69
Reasoning & Knowledge8635
Agentic & Tools6297
Specifications
DeveloperMeta AIStepFun
Family
ReleasedApr 8, 2026May 28, 2026
Parameters
AvailabilityAPI access
Context window256K
Price — $/M input$0.20
Price — $/M output$1.15
Inputstext, image, video
Outputstext
Benchmarks
AIME 2024/202589%
FrontierMath39%
FrontierMath Tier 415%
GPQA Diamond90%81%
Humanity's Last Exam41%20%
SciCode52%40%
SimpleQA Verified66%
WebDev Arena1513
τ²-bench92%99%

Use-case scores are 0–100 percentile composites across each area’s benchmarks, ranked against every model from the past year. Highlighted cells lead each row. Open a model for the full picture.

Frequently asked questions

Which is better for coding, Muse Spark or Step 3.7 Flash?

Across coding benchmarks like SWE-bench Verified and Terminal-Bench, Muse Spark ranks higher — 84th vs 32th percentile among the models tracked on Model Beat.

Want a different match-up? Open the compare tool to add or swap models.

Benchmarks & model data from Epoch AI (CC BY); pricing & specs from OpenRouter. ECI = Epoch Capabilities Index.