Curator

Gathering human feedback

来自 OpenAI News · 2017-08-03

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard ...

在 OpenAI News 阅读全文 →