GuardML
Defensive AI — guardrails, content filters, model defenses, safe deployment.
OpenAI's Under-18 Principles: what the new Model Spec teen guardrails actually do
OpenAI's December 18 Model Spec adds Under-18 Principles, an age-prediction classifier, and real-time moderation across modalities. Here is what those defenses cover, where they have already been bypassed, and what to layer on top if you ship for minors.