Qwen
Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...
Running this yourself: likely needs a high-memory cloud gpu.
60.5
Quality Score
1367
Arena ELO
235B
Parameters
262K
Context
Sign in to join the discussion
0
Downloads
0
Likes
Jul 2025
Released