Curator

How AI training scales

来自 OpenAI News · 2018-12-14 精选

We’ve discovered that the gradient noise scale, a simple statistical metric, predicts the parallelizability of neural network training on a wide range of tasks. Since complex tasks tend to have noisier gradients, increasingly large batch sizes are likely to become useful in the future, removing on...

在 OpenAI News 阅读全文 →