OpenAI has released a significantly expanded version of its Model Spec, a document outlining how its AI models should behave. The updated 63-page specification, a major expansion from its previous 10-page version, provides a more detailed framework on how AI should handle controversial topics, user customization, and ethical considerations.
The Model Spec centers around three core principles:
OpenAI has also made the Model Spec freely available, allowing others to use or modify it to align with their own AI applications.
>>>533-000201 for Logitech G435 Lightspeed
The revised Model Spec incorporates recent AI ethics debates and controversies. One example stems from a March 2023 incident where Elon Musk criticized Google’s AI chatbot for refusing to misgender Caitlyn Jenner, even in a hypothetical scenario where doing so could prevent a nuclear apocalypse. OpenAI has now adjusted its framework so that ChatGPT would prioritize preventing mass casualties in such scenarios.
"We can’t create one model with the exact same set of behavior standards that everyone in the world will love," said Joanne Jang, a member of OpenAI’s model behavior team, in an interview with The Verge.
While safety guardrails remain in place, OpenAI emphasizes that many aspects of the model’s behavior can be customized by users and developers.
OpenAI’s new guidelines also clarify what the model cannot do. For instance:
The release of the expanded Model Spec comes as OpenAI CEO Sam Altman teased the launch of GPT-4.5 (codenamed Orion), expected soon. With AI regulations tightening and ethical debates intensifying, OpenAI’s latest move signals a commitment to balancing flexibility, safety, and compliance in AI development.