Curator

Fine-tuning GPT-2 from human preferences

来自 OpenAI News · 2019-09-19

We’ve fine-tuned the 774M parameter GPT-2 language model using human feedback for various tasks, successfully matching the preferences of the external human labelers, though those preferences did not always match our own. Specifically, for summarization tasks the labelers preferred sentences copie...

在 OpenAI News 阅读全文 →