<cd ../feed
learning-to-summarize-with-human-feedback.log
|src: openai.com

Learning to summarize with human feedback

We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.