Curator

Deliberative alignment: reasoning enables safer language models

来自 OpenAI News · 2024-12-20 精选

LLM推理模型对齐思维链 AI安全

Deliberative alignment: reasoning enables safer language models Introducing our new alignment strategy for o1 models, which are directly taught safety specifications and how to reason over them.

在 OpenAI News 阅读全文 →