GuardML

GuardML

Defensive AI — guardrails, content filters, model defenses, safe deployment.

Latest

OpenAI's Under-18 Principles: what the new Model Spec teen guardrails actually do

OpenAI's December 18 Model Spec adds Under-18 Principles, an age-prediction classifier, and real-time moderation across modalities. Here is what those defenses cover, where they have already been bypassed, and what to layer on top if you ship for minors.

Recent posts