Learning to summarize with human feedback
来自 OpenAI News
· 2020-09-04
精选
We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.