Code Leaderboard

Code generation and understanding

Models

Benchmarks

Providers

Open Weight

Deployable Now

7 self-host

Structured benchmark-backed rows

Signal-backed rows without full normalized tables

Benchmark-expected rows still catching up

This leaderboard still shows tracked models beyond the ones with full benchmark rows. Use the benchmark badge on each row to tell whether the model is Structured, Provider-reported, Arena only, or still Pending benchmark coverage.

LLMs Image Gen Vision Multimodal

#	Model	Capability	Quality	Params	MMLU	GPQA	TerminalBench 2.0	HumanEval	$/M In	Speed	Open
11	Grok Code Fast 1 xAI Subscription