Skip to main content

Models Deploy Leaderboards Marketplace

Track, rank, and compare every AI model in the world.

Platform

Models
Deploy
Leaderboards
Compare
News
Marketplace
Workspace
Deployments
Discover Watchlists
Pricing

Categories

LLMs
Image Gen
Vision
Multimodal
Embeddings
Speech
Video
Code
Browser Agents
Specialized

Company

About
Roadmap
Contact
FAQ
Providers
API
Terms
Privacy

© 2026 AI Market Cap. All rights reserved.

Step 3.5 Flash by Stepfun | AI Market Cap

Step 3.5 Flash

Large Language ModelsProprietary

S

Stepfun

Step 3.5 Flash is StepFun's most capable open-source foundation model. Built on a sparse Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token....

Model updates refreshed6h agoJun 23, 2026news + changelog

What changed

Step 3.5 Flash is StepFun's most capable open-source foundation model.

Archived

This model is still tracked for research and discovery, but it is excluded from default public rankings until it returns to active status.

---

Quality Score

1146

Arena ELO

11B

Parameters

262K

Context

Similar Models

Discussion (0)

Sign in to join the discussion

Loading comments...

0

Downloads

0

Likes

Jan 2026

Released

Benchmarks

19

high

Research

1

low

What Changed Recently

Recent launch, pricing, benchmark, and API signals linked to this model or its provider.

BenchmarksStepFunToday

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 201.967 tok/s | HumanEval: 0.404%

BenchmarksStepFun

Benchmarks & Rankings19

BenchmarksStepFunToday

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 201.967 tok/s | HumanEval: 0.404%

Research Papers1

HF PapersQwenresearch1w ago

77.8

Llama 4 Maverick#77

qwen3-235b-a22b-instruct-2507#225

Today

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 201.967 tok/s | HumanEval: 0.404%

BenchmarksStepFunYesterday

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 194.446 tok/s | HumanEval: 0.404%

BenchmarksStepFun2d ago

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 194.446 tok/s | HumanEval: 0.404%

BenchmarksStepFun3d ago

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 197.655 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFunToday

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 201.967 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFunYesterday

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 194.446 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun2d ago

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 194.446 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun3d ago

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 197.655 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun4d ago

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 212.801 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun5d ago

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 210.122 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun6d ago

Step 3.5 Flash Benchmark Update

Quality: 25.5/100 | Price: $0.15/M tokens | Output: 189.483 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun1w ago

Step 3.5 Flash Benchmark Update

Quality: 37.8/100 | Price: $0.15/M tokens | Output: 177.909 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun1w ago

Step 3.5 Flash Benchmark Update

Quality: 37.8/100 | Price: $0.15/M tokens | Output: 176.358 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun1w ago

Step 3.5 Flash Benchmark Update

Quality: 37.8/100 | Price: $0.15/M tokens | Output: 180.424 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun1w ago

Step 3.5 Flash Benchmark Update

Quality: 37.8/100 | Price: $0.15/M tokens | Output: 197.434 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun1w ago

Step 3.5 Flash Benchmark Update

Quality: 37.8/100 | Price: $0.15/M tokens | Output: 215.94 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun1w ago

Step 3.5 Flash Benchmark Update

Quality: 37.8/100 | Price: $0.15/M tokens | Output: 206.606 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun1w ago

Step 3.5 Flash Benchmark Update

Quality: 37.8/100 | Price: $0.15/M tokens | Output: 171.835 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun2w ago

Step 3.5 Flash Benchmark Update

Quality: 37.8/100 | Price: $0.15/M tokens | Output: 164.267 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun2w ago

Step 3.5 Flash Benchmark Update

Quality: 37.8/100 | Price: $0.15/M tokens | Output: 177.967 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun2w ago

Step 3.5 Flash Benchmark Update

Quality: 37.8/100 | Price: $0.15/M tokens | Output: 190.615 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

BenchmarksStepFun2w ago

Step 3.5 Flash Benchmark Update

Quality: 37.8/100 | Price: $0.15/M tokens | Output: 215.92 tok/s | HumanEval: 0.404%

#benchmark#pricing#artificial-analysis

OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy Distillation

Memory has become a standard substrate for self-evolving agents, yet retaining experience is not the same as learning how to evolve through it. Existing memory agents can store trajectories, retrieve reflections, or accumulate skills, but often lack the holistic competence to select useful experience, act on it, write reusable knowledge, and maintain a growing repository. We introduce OPD-Evolver, a slow-fast co-evolution framework that cultivates such an agent evolver through on-policy self-distillation. In the fast loop, OPD-Evolver interacts with a four-level memory hierarchy to read, use, write, and maintain experience for rapid test-time evolution. In the slow loop, outcome-calibrated memory attribution and privileged hindsight distill these four abilities into the deployable policy. Across multi-domain benchmarks, OPD-Evolver surpasses memory systems such as ReasoningBank by up to 11.5%, and training-based methods such as Skill0 by ~5.8%. Further analysis shows that OPD-Evolver internalizes high-value experience and memory management, enabling OPD-Evolver-9B to challenge giant counterparts such as Qwen3.5-397B-A17B and Step-3.5-Flash, pointing beyond memory-augmented agents toward genuinely qualified agent evolvers.

#huggingface#daily-papers