Qwen3 Next 80B A3B Instruct Benchmark Update
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 188.051 tok/s | MMLU: 0.819% | HumanEval: 0.684%
View sourceQwen
Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...
Running this yourself: likely needs a high-memory cloud gpu.
24.8
Quality Score
---
Arena ELO
80B
Parameters
262K
Context
Sign in to join the discussion
0
Downloads
0
Likes
Sep 2025
Released
Benchmarks
19
Open Source
1
Recent launch, pricing, benchmark, and API signals linked to this model or its provider.
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 188.051 tok/s | MMLU: 0.819% | HumanEval: 0.684%
View sourceQuality: 13.7/100 | Price: $0.875/M tokens | Output: 188.051 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 188.581 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 185.921 tok/s | MMLU: 0.819% | HumanEval: 0.684%
View sourceQuality: 13.7/100 | Price: $0.875/M tokens | Output: 188.056 tok/s | MMLU: 0.819% | HumanEval: 0.684%
View sourceQuality: 13.7/100 | Price: $0.875/M tokens | Output: 188.168 tok/s | MMLU: 0.819% | HumanEval: 0.684%
View sourceQuality: 13.7/100 | Price: $0.875/M tokens | Output: 188.581 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 185.921 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 188.056 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 188.168 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 194.614 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 188.934 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 193.691 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 190.569 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 194.804 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 193.257 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 191.709 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 187.903 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 185.265 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 186.064 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 184.15 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 180.559 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 168.777 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Quality: 13.7/100 | Price: $0.875/M tokens | Output: 165.754 tok/s | MMLU: 0.819% | HumanEval: 0.684%
Qwen3 Next 80B A3B Instruct is now available through local Ollama runtime. 256K context window listed. The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.