o3

Name: o3
Price: 2 USD
Availability: InStock
Rating: 48.6 (1 reviews)
Author: OpenAI

#41Large Language ModelsProprietary

OpenAI

OpenAI's most powerful reasoning model. Achieves state-of-the-art results on complex math, science, and code tasks through extended chain-of-thought.

Model updates refreshed3h agoJul 4, 2026news + changelog

Website View Updates Get API Access

What changed

Achieves state-of-the-art results on complex math, science, and code tasks through extended chain-of-thought.

48.6

Quality Score

1340

Arena ELO

Undisclosed

Parameters

200K

Context

Benchmarks and Competitive Signal

Structured

Use this section to answer one simple question first: how much outside evidence do we have that this model performs well? Structured benchmark scores appear first, then official provider evidence, then live arena signal.

This model has normalized benchmark rows, so scores here are directly comparable across benchmark sources.

HumanEvalcoding

Similar Models

Discussion (0)

Loading comments...

Official Benchmark Evidence

These are recent benchmark or leaderboard claims from official provider sources. They are useful for freshness and context, but they are not treated the same as normalized independent benchmark rows.

o3-pro Benchmark Update

Benchmarksartificial-analysisJul 4, 2026

Quality: 32.5/100 | Price: $35/M tokens | Output: 30.694 tok/s

View source

o3 Benchmark Update

Benchmarksartificial-analysisJul 4, 2026

Quality: 30.4/100 | Price: $3.5/M tokens | Output: 135.319 tok/s | MMLU: 0.853% | HumanEval: 0.808%

View source

o3-mini-2025-01-31 - Arena-Hard-Auto

Benchmarksarena-hard-autoJul 4, 2026

Arena-Hard-Auto official Gemini-2.5 judged score 50.0 with CI 0/0

View source

o3-20250416 - SWE-Bench Verified

Benchmarksswe-benchJul 4, 2026

SWE-Bench Verified resolved rate 58.4

View source

O3-Mini-2025-01-31 (Low) - LiveCodeBench

BenchmarkslivecodebenchJul 4, 2026

LiveCodeBench pass@1 70.6 across 1055 tasks

View source

o3-pro Benchmark Update

Benchmarksartificial-analysisJul 3, 2026

Quality: 32.5/100 | Price: $35/M tokens | Output: 27.436 tok/s

View source

Arena ELO Ratings

Vision Arena

110 snapshotsArena Rank #51

1217

ELO Score

1210 - 1224

95% Confidence

+/-7 points

46.6K

Battles

Jul 4, 2026

Last Updated

90012001500

Chatbot Arena

110 snapshotsArena Rank #31

1340

ELO Score

1337 - 1345

95% Confidence

+/-4 points

19.4K

Battles

Jul 4, 2026

Last Updated

90012001500