grok-4

#220Large Language ModelsProprietary

xAI

Grok 4 is xAI’s most advanced reasoning model. Excels at logical thinking and in-depth analysis. Ideal for insightful discussions and complex problem-solving.

Model updates refreshed3d agoJul 1, 2026news + changelog

Website View Updates

66.3

Quality Score

1443

Arena ELO

Undisclosed

Parameters

256K

Context

Benchmarks and Competitive Signal

Structured

Use this section to answer one simple question first: how much outside evidence do we have that this model performs well? Structured benchmark scores appear first, then official provider evidence, then live arena signal.

This model has normalized benchmark rows, so scores here are directly comparable across benchmark sources.

MMLUknowledge

86.6

Similar Models

Discussion (0)

Loading comments...

Official Benchmark Evidence

These are recent benchmark or leaderboard claims from official provider sources. They are useful for freshness and context, but they are not treated the same as normalized independent benchmark rows.

Grok 4.1 | xAI Docs

Benchmarksprovider-benchmarksMar 2, 2026

Grok 4.1 | xAI Docs Docs Search ⌘ K API Console Products Grok Resources llms.txt Discord Email support Terms and Policies Get Started Welcome Quickstart Models Pricing Release Notes Build New Getting Started Skills, Plugins, and Marketplaces Modes and Commands Headless and Scripting Enterprise Deployments Text Text Generation Reasoning Structured Outputs Streaming Multi Agent Completions (Legacy) Imagine Overview Image Editing Multi-Image Editing Image Generation Video Genera

View source

Arena ELO Ratings

Vision Arena

108 snapshotsArena Rank #70

1182

ELO Score

1175 - 1189

95% Confidence

+/-7 points

32.7K

Battles

Jul 4, 2026

Last Updated

90012001500

Chatbot Arena

108 snapshotsArena Rank #2