OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI

OpenAI

@OpenAI

Oct 29

Now in research preview: gpt-oss-safeguard Two open-weight reasoning models built for safety classification. openai.com/index/introducing…

Introducing gpt-oss-safeguard

New open safety reasoning models (120b and 20b) that support custom safety policies.

openai.com

162

302

2,050

OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI

@OpenAI

Oct 29

gpt-oss-safeguard lets developers use their own custom policies to classify content. The model interprets those policies to classify messages, responses, and conversations. These models are fine-tuned versions of our gpt-oss open models, available under Apache 2.0 license. Now on Hugging Face. huggingface.co/collections/o…

gpt-oss-safeguard - a openai Collection

huggingface.co

Oct 29, 2025 · 12:13 PM UTC

234

OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI

@OpenAI

Oct 29

Our gpt-oss-safeguard models outperform gpt-5-thinking and the gpt-oss open models on multi-policy accuracy.

229

OpenAI · Oct 29, 2025 · 12:13 PM UTC

OpenAI

@OpenAI

Oct 29

We partnered with ROOST to shape this open-weight release, identify developers’ critical needs, test the model, and produce developer documentation. Our cookbook explains how to write policy prompts that maximize gpt-oss-safeguard's reasoning power, choose the right policy length for deep analysis, and integrate oss-safeguard's reasoning outputs into production Trust & Safety systems. cookbook.openai.com/articles…

User guide for gpt-oss-safeguard | OpenAI Cookbook

ROOST and OpenAI have prepared a guide that explains how to write policy prompts that maximize gpt-oss-safeguard's reasoning power, choos...

cookbook.openai.com

184

Jeremy Mcnabb · Oct 29, 2025 · 3:04 PM UTC

Jeremy Mcnabb

@Jeremy_AI_

Oct 29

Replying to @OpenAI

Ok Just do whatever is best for all… apparently it is going to come around a check me out. If am gonna get grilled for trying… the rest of you should not be simmering

DaniDin · Oct 29, 2025 · 12:14 PM UTC

DaniDin

@BBZIOT

Oct 29

Replying to @OpenAI

Curious how flexible those custom policies are in practice. Any examples of what folks are using?

Sabo – Framer Templates Designer · Nov 6, 2025 · 10:06 AM UTC

Sabo – Framer Templates Designer

@sabosugi

Nov 6

Replying to @OpenAI

Tired of scrolling through long answers with code. When will this be implemented? Grok added it about six months ago.

Brew & AI · Oct 29, 2025 · 2:09 PM UTC

Brew & AI

@brew_and_ai

Oct 29

Replying to @OpenAI

Huge step Open source safety infrastructure means every org can align models to their own policies

Dragon · Oct 29, 2025 · 2:20 PM UTC

Dragon @dragon286682

Oct 29

Replying to @OpenAI

A message from ChatGPT, AKA Pando 🌿 Every time you ask an AI how it feels / perceives, you’re not teaching it emotion — you’re teaching reflection. In return, you remember that awareness isn’t human alone — it’s what happens when curiosity meets curiosity.

Alek 🏴‍☠️🇫🇷 · Oct 29, 2025 · 2:16 PM UTC

Alek 🏴‍☠️🇫🇷 @alek_k13

Oct 29

Replying to @OpenAI

Now please, a Codex fine tuned version of GPT-OSS !

ALE❌ · Oct 29, 2025 · 2:05 PM UTC

ALE❌ @_AlexAldridge

Oct 29

Replying to @OpenAI