Code generation and understanding
12
Models
9
Benchmarks
5
Providers
7
Open Weight
9
Deployable Now
7 self-host
10
Structured benchmark-backed rows
0
Signal-backed rows without full normalized tables
0
Benchmark-expected rows still catching up
| # | Model | Capability | Quality | Params | MMLU | GPQA | TerminalBench 2.0 | HumanEval | $/M In | Speed | Open |
|---|---|---|---|---|---|---|---|---|---|---|---|
11 | Grok Code Fast 1 xAI Subscription |
| 60.3 |
| 23.0 |
| Undisclosed |
| 79.3 |
| 72.7 |
| 25.8 |
| 65.7 |
$0.2000/M Cheapest verified |
| — |
5 | CodeLlama 70B Meta Open WeightsStructured | 52.0 | 59.9 | 70B | — | — | — | — | Free Free | — |
7 | Devstral Medium Mistral AI Open WeightsStructured | 39.5 | 23.5 | Unknown | 70.8 | 49.2 | — | 33.7 | $0.4000/M Cheapest verified | — |
1 | GPT-5.1 Codex OpenAI SubscriptionStructured | — | — | Undisclosed | — | — | 57.8 | — | $1.25/M Cheapest verified | — |
4 | Qwen3 Next 80B A3B Instruct Qwen Open WeightsStructured | — | — | 80B | 81.9 | 73.8 | — | 68.4 | $0.0900/M Cheapest verified | — |
6 | Qwen3 Coder 30B A3B Instruct Qwen Open WeightsStructured | — | — | 30B | 70.6 | 51.6 | — | 40.3 | $0.0700/M Cheapest verified | — |
8 | Qwen3 Coder Plus Qwen Open WeightsStructured | — | — | Unknown | — | — | — | — | $0.6500/M Cheapest verified | — |
9 | Qwen3 Coder Flash Qwen Open WeightsStructured | — | — | Unknown | — | — | — | — | $0.1950/M Cheapest verified | — |
12 | qwen-2.5-coder-32b-instruct Qwen Open WeightsStructured | — | — | 32B | — | — | — | — | $0.6600/M Cheapest verified | — |
13 | Codex Mini OpenAI Structured | — | — | Undisclosed | — | — | 43.1 | — | $1.50/M Cheapest verified | — |
15 | codex-remote-for-engineering OpenAI Not standardized | — | — | Undisclosed | — | — | — | — | — | — |
16 | codex-plugin OpenAI Not standardized | — | — | Undisclosed | — | — | — | — | — | — |