o1

Name: o1
Author: OpenAI

Large Language ModelsProprietary

OpenAI

Previous full o-series reasoning model. Uses extended internal chain-of-thought before responding and remains strong at science and math, but later o3 and o4 releases are the newer frontier generation.

Model updates refreshed32m agoJul 5, 2026news + changelog

Website View Updates Start Free Trial

Lifecycle

Previous full o-series reasoning model.

---

Quality Score

1350

Arena ELO

Undisclosed

Parameters

200K

Context

Benchmarks and Competitive Signal

Structured

Use this section to answer one simple question first: how much outside evidence do we have that this model performs well? Structured benchmark scores appear first, then official provider evidence, then live arena signal.

This model has normalized benchmark rows, so scores here are directly comparable across benchmark sources.

LiveBench Codingcoding

Similar Models

Discussion (0)

Loading comments...

Official Benchmark Evidence

These are recent benchmark or leaderboard claims from official provider sources. They are useful for freshness and context, but they are not treated the same as normalized independent benchmark rows.

o1-2024-12-17 - Arena-Hard-Auto

Benchmarksarena-hard-autoJul 5, 2026

Arena-Hard-Auto official Gemini-2.5 judged score 55.9 with CI -2.2/1.8

View source

https://huggingface.co/agentica-org/DeepSWE-Preview - SWE-Bench Verified

Benchmarksswe-benchJul 5, 2026

SWE-Bench Verified resolved rate 58.8

View source

We’re introducing GeneBench-Pro, a research-level benchmark for a harder kind of AI progress: how well agents can navigate messy biological data, choose the right analysis path, and make judgment call

Benchmarksx-twitterJun 30, 2026

View source

Introducing GPT‑5 for developers

Benchmarksprovider-benchmarksJun 26, 2026

Introducing GPT‑5 for developers | OpenAI Skip to main content Research Products Business Developers Company Foundation (opens in a new window) Log in Try ChatGPT (opens in a new window) Research Products Business Developers Company Foundation (opens in a new window) Try ChatGPT (opens in a new window) Login OpenAI August 7, 2025 Product Introducing GPT‑5 for developers The best model for coding and agentic tasks. Loading… Share Introduction Introduction Coding Frontend engin

View source

https://huggingface.co/agentica-org/DeepSWE-Preview - SWE-Bench Verified

Benchmarksswe-benchMar 13, 2026

SWE-Bench Verified resolved rate 58.8

View source

o1-2024-12-17 - Arena-Hard-Auto

Benchmarksarena-hard-autoMar 13, 2026

Arena-Hard-Auto official Gemini-2.5 judged score 55.9 with CI -2.2/1.8

View source

Arena ELO Ratings

Chatbot Arena

102 snapshotsArena Rank #28

1350

ELO Score

1347 - 1354

95% Confidence

+/-4 points

33.2K

Battles

Jul 5, 2026

Last Updated

90012001500

Vision Arena

99 snapshotsArena Rank #64

1193

ELO Score

1182 - 1204

95% Confidence

+/-11 points

3.7K

Battles

Jul 5, 2026

Last Updated

90012001500