Microsoft
Phi-4-reasoning-vision-15B is a open-weight Microsoft specialized model.
Running this yourself: desktop gpu should be enough.
63.9
Quality Score
1221
Arena ELO
15B
Parameters
---
Context
Use this section to answer one simple question first: how much outside evidence do we have that this model performs well? Structured benchmark scores appear first, then official provider evidence, then live arena signal.
This model has normalized benchmark rows, so scores here are directly comparable across benchmark sources.
Sign in to join the discussion
22.8K
Downloads
156
Likes
Jan 2026
Released
These are recent benchmark or leaderboard claims from official provider sources. They are useful for freshness and context, but they are not treated the same as normalized independent benchmark rows.
MAI‑Transcribe‑1 makes speech‑to‑text clearer, faster, and more reliable even in noisy audio. Ranked #1 on the industry-standard FLEURS word error rate benchmark. Now in public preview. Learn more: ht
MAI‑Transcribe‑1 makes speech‑to‑text clearer, faster, and more reliable even in noisy audio. Ranked #1 on the industry-standard FLEURS word error rate benchmark. Now in public preview. Learn more: https://t.co/Gr4Q8jgCwL https://t.co/L6hndn3D34
View sourceMeet MAI‑Image‑2. Built with creatives, for real creative work. Ranked #5 on @arena’s text‑to‑image leaderboard. Available now: https://t.co/qT5nArWDm6 https://t.co/fWoDgxlCf5
Meet MAI‑Image‑2. Built with creatives, for real creative work. Ranked #5 on @arena’s text‑to‑image leaderboard. Available now: https://t.co/qT5nArWDm6 https://t.co/fWoDgxlCf5
View sourceBlackBeenie/Neos-Phi-3-14B-v0.1 — Open LLM Leaderboard #55
Avg: 27.0 | IFEval: 40.2 | BBH: 46.6 | MATH: 17.8 | GPQA: 7.4 | MMLU-PRO: 39.6
View sourcemicrosoft/Phi-4-reasoning-vision-15B · Hugging Face
Microsoft published benchmark or leaderboard evidence for Phi-4-reasoning-vision-15B, Phi-4 Reasoning Vision 15B.
View source1221
ELO Score
1218 - 1225
95% Confidence
+/-4 points
25.2K
Battles
Mar 22, 2026
Last Updated