<cd ../feed
improving-instruction-hierarchy-in-frontier-llms.log
|src: openai.com

Improving instruction hierarchy in frontier LLMs

IH-Challenge trains models to prioritize trusted instructions, improving instruction hierarchy, safety steerability, and resistance to prompt injection attacks.