Grok 4.20 Beta

#313MultimodalProprietary

xAI

Grok 4. 20 Beta is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering consistently precise and truthful responses.

Model updates refreshed3d agoJul 1, 2026news + changelog

View Updates Subscribe

What changed

Grok 4.20 Beta is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities.

38.0

Quality Score

1253

Arena ELO

Undisclosed

Parameters

Context

Benchmarks and Competitive Signal

Structured

Use this section to answer one simple question first: how much outside evidence do we have that this model performs well? Structured benchmark scores appear first, then official provider evidence, then live arena signal.

This model has normalized benchmark rows, so scores here are directly comparable across benchmark sources.

TAU-Benchreasoning

17.6

SWE-Benchcoding