OpenAI releases gpt-oss-safeguard, its open-weight reasoning models for safety classification tasks, available in 120B and 20B parameters, under Apache 2.0
New open safety reasoning models (120b and 20b) that support custom safety policies. — Today, we're releasing a research preview …
🧑💻 gpt-oss-safeguard Hackathon 🧑💻 Join us Dec. 8 in SF for the Open Safeguard Hackathon — a collaborative event by OpenAI, ROOST & @HuggingFace to explore how open models can shape safer digital spaces and explore the future of open-weight reasoning and online safety. Apply to
gpt-oss-safeguard lets developers use their own custom policies to classify content. The model interprets those policies to classify messages, responses, and conversations. These models are fine-tuned versions of our gpt-oss open models, available under Apache 2.0 license. Now