xAI
Grok 4. 20 Beta is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering consistently precise and truthful responses.
Grok 4.20 Beta is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities.
38.9
Quality Score
1251
Arena ELO
Undisclosed
Parameters
2M
Context
Use this section to answer one simple question first: how much outside evidence do we have that this model performs well? Structured benchmark scores appear first, then official provider evidence, then live arena signal.
This model has normalized benchmark rows, so scores here are directly comparable across benchmark sources.
Sign in to join the discussion
0
Downloads
0
Likes
Mar 2026
Released
These are recent benchmark or leaderboard claims from official provider sources. They are useful for freshness and context, but they are not treated the same as normalized independent benchmark rows.
Grok 4 Model Card
benchmarks. Because our models push the frontier of AI capabilities, we are committed to mitigating Our approach to safety evaluations focuses on measuring specific safety-relevant behaviors relevant to our current evaluation methodology, results, and mitigations for these various behaviors.
View source1251
ELO Score
1243 - 1259
95% Confidence
+/-8 points
9.6K
Battles
May 20, 2026
Last Updated