OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI

OpenAI

@OpenAI

Oct 29

Now in research preview: gpt-oss-safeguard Two open-weight reasoning models built for safety classification. openai.com/index/introducing…

Introducing gpt-oss-safeguard

New open safety reasoning models (120b and 20b) that support custom safety policies.

openai.com

162

302

2,050

OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI

@OpenAI

Oct 29

gpt-oss-safeguard lets developers use their own custom policies to classify content. The model interprets those policies to classify messages, responses, and conversations. These models are fine-tuned versions of our gpt-oss open models, available under Apache 2.0 license. Now on Hugging Face. huggingface.co/collections/o…

gpt-oss-safeguard - a openai Collection

huggingface.co

234

OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI

@OpenAI

Oct 29

Our gpt-oss-safeguard models outperform gpt-5-thinking and the gpt-oss open models on multi-policy accuracy.

229

OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI

@OpenAI

Oct 29

We partnered with ROOST to shape this open-weight release, identify developers’ critical needs, test the model, and produce developer documentation. Our cookbook explains how to write policy prompts that maximize gpt-oss-safeguard's reasoning power, choose the right policy length for deep analysis, and integrate oss-safeguard's reasoning outputs into production Trust & Safety systems. cookbook.openai.com/articles…

User guide for gpt-oss-safeguard | OpenAI Cookbook

ROOST and OpenAI have prepared a guide that explains how to write policy prompts that maximize gpt-oss-safeguard's reasoning power, choos...

cookbook.openai.com

184

AI In Practice · Oct 29, 2025 · 2:00 PM UTC

AI In Practice · Oct 29, 2025 · 2:00 PM UTC

AI In Practice

@AIinPractice

Oct 29

Replying to @OpenAI

People are going to learn a ton from how these models reason and handle real-world safety tradeoffs. The cookbook sounds like a must-read.

Oct 29, 2025 · 2:00 PM UTC