Curator

Language models can explain neurons in language models

来自 OpenAI News · 2023-05-09 精选

LLM推理模型评测 Transformer AI安全

We use GPT-4 to automatically write explanations for the behavior of neurons in large language models and to score those explanations. We release a dataset of these (imperfect) explanations and scores for every neuron in GPT-2.

在 OpenAI News 阅读全文 →