BenchmarksMicrosoft Azure2mo ago
Phi-4 Benchmark Update
Quality: 13.2/100 | Price: $0.219/M tokens | Output: 6.894 tok/s | MMLU: 0.714% | HumanEval: 0.231%
View sourceparagekbote
phi-4-reasoning-plus tuned for scalable inference with long context using Unsloth.
Running this yourself: can likely run on your own machine.
This model is still tracked for research and discovery, but it is excluded from default public rankings until it returns to active status.
---
Quality Score
1221
Arena ELO
Unknown
Parameters
---
Context
Sign in to join the discussion
0
Downloads
0
Likes
Dec 2025
Released
Benchmarks
1
Recent launch, pricing, benchmark, and API signals linked to this model or its provider.
Quality: 13.2/100 | Price: $0.219/M tokens | Output: 6.894 tok/s | MMLU: 0.714% | HumanEval: 0.231%
View sourceQuality: 13.2/100 | Price: $0.219/M tokens | Output: 6.894 tok/s | MMLU: 0.714% | HumanEval: 0.231%