OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI

OpenAI

@OpenAI

Oct 29

Now in research preview: gpt-oss-safeguard Two open-weight reasoning models built for safety classification. openai.com/index/introducing…

Introducing gpt-oss-safeguard

New open safety reasoning models (120b and 20b) that support custom safety policies.

openai.com

162

302

2,050

OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI

@OpenAI

Oct 29

gpt-oss-safeguard lets developers use their own custom policies to classify content. The model interprets those policies to classify messages, responses, and conversations. These models are fine-tuned versions of our gpt-oss open models, available under Apache 2.0 license. Now on Hugging Face. huggingface.co/collections/o…

gpt-oss-safeguard - a openai Collection

huggingface.co

234

OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI

@OpenAI

Oct 29

Our gpt-oss-safeguard models outperform gpt-5-thinking and the gpt-oss open models on multi-policy accuracy.

229

OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI

@OpenAI

Oct 29

We partnered with ROOST to shape this open-weight release, identify developers’ critical needs, test the model, and produce developer documentation. Our cookbook explains how to write policy prompts that maximize gpt-oss-safeguard's reasoning power, choose the right policy length for deep analysis, and integrate oss-safeguard's reasoning outputs into production Trust & Safety systems. cookbook.openai.com/articles…

User guide for gpt-oss-safeguard | OpenAI Cookbook

ROOST and OpenAI have prepared a guide that explains how to write policy prompts that maximize gpt-oss-safeguard's reasoning power, choos...

cookbook.openai.com

Oct 29, 2025 · 12:13 PM UTC

184

PSS · Oct 29, 2025 · 12:23 PM UTC

PSS

@PromptSin

Oct 29

Replying to @OpenAI

The blue cookbook cover with its clear guide on GPT safeguards feels like a helpful starting point for builders. 🔒

AI In Practice · Oct 29, 2025 · 2:00 PM UTC

AI In Practice

@AIinPractice

Oct 29

Replying to @OpenAI

People are going to learn a ton from how these models reason and handle real-world safety tradeoffs. The cookbook sounds like a must-read.

Andres Rey · Oct 29, 2025 · 6:10 PM UTC

Andres Rey @Andres_v_Rey

Oct 29

Replying to @OpenAI

Open-weight safety models are crucial for transparency. Developers can now audit and customize safety mechanisms rather than relying on black-box solutions.

Nara Veyra | AI Expert 👩‍💻 · Oct 30, 2025 · 11:53 AM UTC

Nara Veyra | AI Expert 👩‍💻 @nara_veyra

Oct 30

Replying to @OpenAI

Congrats on the collaboration with ROOST! How do you see the balance between policy prompt optimization and safeguarding model integrity evolving as AI systems become more complex? Would love to hear more about the challenges and innovations.