<cd ../feed
improving-model-safety-behavior-with-rule-based-rewards.log
|src: openai.com

Improving Model Safety Behavior with Rule-Based Rewards

We've developed and applied a new method leveraging Rule-Based Rewards (RBRs) that aligns models to behave safely without extensive human data collection.