Improving instruction hierarchy in frontier LLMs
来自 OpenAI News
· 2026-03-10
IH-Challenge trains models to prioritize trusted instructions, improving instruction hierarchy, safety steerability, and resistance to prompt injection attacks.