Skip to main content

Models Deploy Leaderboards Marketplace

Track, rank, and compare every AI model in the world.

Platform

Models
Deploy
Leaderboards
Compare
News
Marketplace
Workspace
Deployments
Discover Watchlists
Pricing

Categories

LLMs
Image Gen
Vision
Multimodal
Embeddings
Speech
Video
Code
Browser Agents
Specialized

Company

About
Roadmap
Contact
FAQ
Providers
API
Terms
Privacy

© 2026 AI Market Cap. All rights reserved.

gpt-oss-safeguard-20b by OpenAI | AI Market Cap

gpt-oss-safeguard-20b

#320Large Language ModelsOpen Weights

O

OpenAI

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...

Running this yourself: likely needs a rented cloud gpu.

Model updates refreshed7h agoJun 23, 2026news + changelog

Website View Updates Subscribe

Archived

This model is still tracked for research and discovery, but it is excluded from default public rankings until it returns to active status.

32.3

Quality Score

---

Arena ELO

20B

Parameters

131K

Context

Similar Models

Discussion (0)

Sign in to join the discussion

Loading comments...

0

Downloads

0

Likes

Oct 2025

Released

Launches

4

high

Benchmarks

3

high

Open Source

1

medium

Research

2

low

General

3

low

What Changed Recently

Recent launch, pricing, benchmark, and API signals linked to this model or its provider.

LaunchesOpenAI6d ago

Introducing LifeSciBench

Social & Blog Posts9

BlogOpenAIannouncementgeneral

Research Papers1

arXivOpenAIai2mo ago

Other

ollama-libraryOpenAIopen_sourceopen sourceToday

gpt-oss-safeguard-20b is now available on Ollama

77.8

Llama 4 Maverick#77

qwen3-235b-a22b-instruct-2507#225

LaunchesOpenAI6d ago

We’re sharing new research on a method for anticipating how models may behave in real-world use before release: simulating deployment with recent, de-identified user requests and studying candidate mo

We’re sharing new research on a method for anticipating how models may behave in real-world use before release: simulating deployment with recent, de-identified user requests and studying candidate model responses. https://t.co/7RJzBfNniQ

LaunchesOpenAI1w ago

Predicting model behavior before release by simulating deployment

LaunchesOpenAI1w ago

Introducing the OpenAI Partner Network

BenchmarksOpenAI4d ago

Introducing GPT‑5 for developers

Introducing GPT‑5 for developers | OpenAI Skip to main content Research Products Business Developers Company Foundation (opens in a new window) Log in Try ChatGPT (opens in a new window) Research Products Business Developers Company Foundation (opens in a new window) Try ChatGPT (opens in a new window) Login OpenAI August 7, 2025 Product Introducing GPT‑5 for developers The best model for coding and agentic tasks. Loading… Share Introduction Introduction Coding Frontend engin

Yesterday

Samsung Electronics brings ChatGPT and Codex to employees

#openai#blog#rss

X/Twitter@OpenAIOpenAIresearchresearch4d ago

As AI takes on longer, higher-stakes tasks, we want models to carry beneficial and safe behavior into new domains beyond their training—and maintain it under pressure. That’s the idea behind our new r

As AI takes on longer, higher-stakes tasks, we want models to carry beneficial and safe behavior into new domains beyond their training—and maintain it under pressure. That’s the idea behind our new research on training models to be broadly and persistently beneficial.

#openai#twitter#x

BlogOpenAIannouncementgeneral5d ago

Improving health intelligence in ChatGPT

#openai#blog#rss

Introducing LifeSciBench, a benchmark for measuring and improving how well AI supports real-world life science research. Developed with 173 scientists from biotechnology and pharmaceutical research, L

X/Twitter@OpenAIOpenAIbenchmark5d ago

Introducing LifeSciBench, a benchmark for measuring and improving how well AI supports real-world life science research. Developed with 173 scientists from biotechnology and pharmaceutical research, L

Introducing LifeSciBench, a benchmark for measuring and improving how well AI supports real-world life science research. Developed with 173 scientists from biotechnology and pharmaceutical research, LifeSciBench includes 750 expert-authored tasks across seven biological research https://t.co/JDkKWcnL9F

#openai#twitter#x

BlogOpenAIlaunchlaunch6d ago

Introducing LifeSciBench

#openai#blog#rss

X/Twitter@OpenAIOpenAIlaunchlaunch6d ago

We’re sharing new research on a method for anticipating how models may behave in real-world use before release: simulating deployment with recent, de-identified user requests and studying candidate mo

We’re sharing new research on a method for anticipating how models may behave in real-world use before release: simulating deployment with recent, de-identified user requests and studying candidate model responses. https://t.co/7RJzBfNniQ

#openai#twitter#x

Let’s talk about evals. We’re always looking for better ways to measure and forecast model progress, especially as benchmarks get saturated or gamed. @tejalpatwardhan, who leads our frontier evals tea

X/Twitter@OpenAIOpenAIannouncementgeneral6d ago

Let’s talk about evals. We’re always looking for better ways to measure and forecast model progress, especially as benchmarks get saturated or gamed. @tejalpatwardhan, who leads our frontier evals tea

Let’s talk about evals. We’re always looking for better ways to measure and forecast model progress, especially as benchmarks get saturated or gamed. @tejalpatwardhan, who leads our frontier evals team, spoke to @andrewmayne about why evals matter and what models need to be https://t.co/Q3oRCuNxYB

#openai#twitter#x

BlogOpenAIlaunchlaunch1w ago

Predicting model behavior before release by simulating deployment

#openai#blog#rss

BlogOpenAIlaunchlaunch1w ago

Introducing the OpenAI Partner Network

#openai#blog#rss

Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs

LLM agents like Claude Code can not only write code but also be used for autonomous AI research and engineering \citep{rank2026posttrainbench, novikov2025alphaevolve}. We show that an \emph{autoresearch}-style pipeline \citep{karpathy2026autoresearch} powered by Claude Code discovers novel white-box adversarial attack \textit{algorithms} that \textbf{significantly outperform all existing (30+) methods} in jailbreaking and prompt injection evaluations. Starting from existing attack implementations, such as GCG~\citep{zou2023universal}, the agent iterates to produce new algorithms achieving up to 40\% attack success rate on CBRN queries against GPT-OSS-Safeguard-20B, compared to $\leq$10\% for existing algorithms (\Cref{fig:teaser}, left). The discovered algorithms generalize: attacks optimized on surrogate models transfer directly to held-out models, achieving \textbf{100\% ASR against Meta-SecAlign-70B} \citep{chen2025secalign} versus 56\% for the best baseline (\Cref{fig:teaser}, middle). Extending the findings of~\cite{carlini2025autoadvexbench}, our results are an early demonstration that incremental safety and security research can be automated using LLM agents. White-box adversarial red-teaming is particularly well-suited for this: existing methods provide strong starting points, and the optimization objective yields dense, quantitative feedback. We release all discovered attacks alongside baseline implementations and evaluation code at https://github.com/romovpa/claudini.

#cs.LG#cs.AI#cs.CR

gpt-oss-safeguard-20b is now available through local Ollama runtime. 128K context window listed. gpt-oss-safeguard-20b and gpt-oss-safeguard-120b are safety reasoning models built-upon gpt-oss

#deployability#ollama#openai

provider-benchmarksOpenAI4d ago

Introducing GPT‑5 for developers

Introducing GPT‑5 for developers | OpenAI Skip to main content Research Products Business Developers Company Foundation (opens in a new window) Log in Try ChatGPT (opens in a new window) Research Products Business Developers Company Foundation (opens in a new window) Try ChatGPT (opens in a new window) Login OpenAI August 7, 2025 Product Introducing GPT‑5 for developers The best model for coding and agentic tasks. Loading… Share Introduction Introduction Coding Frontend engin

#benchmark#provider-reported#official

provider-benchmarksOpenAI2w ago

Models | OpenAI API

Models | OpenAI API Home API Docs Guides and concepts for the OpenAI API API reference Endpoints, parameters, and responses Codex Docs Guides, concepts, and product docs for Codex Use cases Example workflows and tasks teams hand to Codex ChatGPT Apps SDK Build apps to extend ChatGPT Commerce Build commerce flows in ChatGPT Ads Publish and measure ads in ChatGPT Resources Showcase Demo apps to get inspired Blog Learnings and experiences from developers Cookbook Notebook exampl

#benchmark#provider-reported#official