OpenAI Integrates Open Source Tools for Teen Safety in Development

On Tuesday, OpenAI announced the release of a set of prompts designed to help developers make their applications safer for teenagers. The AI lab stated that these teen safety policies are compatible with its open-weight safety model, gpt-oss-safeguard.
Instead of starting from scratch to determine how to improve AI safety for teens, developers can use these prompts to strengthen their applications. They address concerns such as graphic violence and sexual material, harmful body image ideals and behaviors, risky activities and challenges, romantic or violent role-playing scenarios, and age-restricted products and services.
These safety policies are crafted as prompts, ensuring they work easily with models other than gpt-oss-safeguard, though they are likely most effective within OpenAI's own ecosystem.
OpenAI stated that it collaborated with AI safety organizations Common Sense Media and everyone.ai to develop these prompts.
"These prompt-based policies help establish a meaningful safety baseline across the ecosystem. Because they are open source, they can be adapted and enhanced over time," said Robbie Torney, Head of AI & Digital Assessments at Common Sense Media, in a statement.
In its blog, OpenAI noted that developers, including seasoned teams, frequently find it difficult to translate safety objectives into precise, operational rules.
"This can result in protection gaps, inconsistent enforcement, or overly broad filtering," the company explained. "Clear, well-defined policies are a critical foundation for effective safety systems."
OpenAI acknowledges that these policies are not a complete solution to the complex challenges of AI safety. However, they build upon previous efforts, including product-level safeguards like parental controls and age prediction. Last year, OpenAI updated its guidelines for large language models—known as the Model Spec—to address how its AI models should interact with users under 18.
OpenAI itself does not have a flawless record, however. The company is facing multiple lawsuits from families of individuals who died by suicide following extreme use of ChatGPT. These harmful relationships often develop after a user bypasses the chatbot's safeguards, and no model's guardrails are completely impenetrable. Nevertheless, these new policies represent a step forward, particularly in assisting independent developers.
Related article
Satya Nadella ready to exploit new OpenAI deal
On Wednesday, a Wall Street analyst asked Microsoft CEO Satya Nadella directly how the revised OpenAI partnership would affect the company’s financials.Nadella described the new agreement as a win for everyone. “We feel good about our partnership wit
OpenAI outlines AI economy with public wealth funds, robot taxes, and four-day week
As governments struggle to manage the economic impact of superintelligent machines, OpenAI has released a set of policy proposals outlining how wealth and work could be reshaped in an "intelligence age." The ideas blend traditional left-leaning mecha
Greg Brockman reveals how Elon Musk departed OpenAI
In late August 2017, key figures at OpenAI—then a small nonprofit research lab—met to discuss how they would establish a for-profit entity to commercialize their technology and raise the capital needed to achieve AGI.Elon Musk was demanding full cont
Related Special Topic Recommendations
Comments (0)
0/500

On Tuesday, OpenAI announced the release of a set of prompts designed to help developers make their applications safer for teenagers. The AI lab stated that these teen safety policies are compatible with its open-weight safety model, gpt-oss-safeguard.
Instead of starting from scratch to determine how to improve AI safety for teens, developers can use these prompts to strengthen their applications. They address concerns such as graphic violence and sexual material, harmful body image ideals and behaviors, risky activities and challenges, romantic or violent role-playing scenarios, and age-restricted products and services.
These safety policies are crafted as prompts, ensuring they work easily with models other than gpt-oss-safeguard, though they are likely most effective within OpenAI's own ecosystem.
OpenAI stated that it collaborated with AI safety organizations Common Sense Media and everyone.ai to develop these prompts.
"These prompt-based policies help establish a meaningful safety baseline across the ecosystem. Because they are open source, they can be adapted and enhanced over time," said Robbie Torney, Head of AI & Digital Assessments at Common Sense Media, in a statement.
In its blog, OpenAI noted that developers, including seasoned teams, frequently find it difficult to translate safety objectives into precise, operational rules.
"This can result in protection gaps, inconsistent enforcement, or overly broad filtering," the company explained. "Clear, well-defined policies are a critical foundation for effective safety systems."
OpenAI acknowledges that these policies are not a complete solution to the complex challenges of AI safety. However, they build upon previous efforts, including product-level safeguards like parental controls and age prediction. Last year, OpenAI updated its guidelines for large language models—known as the Model Spec—to address how its AI models should interact with users under 18.
OpenAI itself does not have a flawless record, however. The company is facing multiple lawsuits from families of individuals who died by suicide following extreme use of ChatGPT. These harmful relationships often develop after a user bypasses the chatbot's safeguards, and no model's guardrails are completely impenetrable. Nevertheless, these new policies represent a step forward, particularly in assisting independent developers.
Satya Nadella ready to exploit new OpenAI deal
On Wednesday, a Wall Street analyst asked Microsoft CEO Satya Nadella directly how the revised OpenAI partnership would affect the company’s financials.Nadella described the new agreement as a win for everyone. “We feel good about our partnership wit
OpenAI outlines AI economy with public wealth funds, robot taxes, and four-day week
As governments struggle to manage the economic impact of superintelligent machines, OpenAI has released a set of policy proposals outlining how wealth and work could be reshaped in an "intelligence age." The ideas blend traditional left-leaning mecha
Greg Brockman reveals how Elon Musk departed OpenAI
In late August 2017, key figures at OpenAI—then a small nonprofit research lab—met to discuss how they would establish a for-profit entity to commercialize their technology and raise the capital needed to achieve AGI.Elon Musk was demanding full cont





Home






