Language models can explain neurons in language models

Posted by admin on May 10, 2023

We use GPT-4 to automatically write explanations for the behavior of neurons in large language models and to score those explanations. We release a dataset of these (imperfect) explanations and scores for every neuron in GPT-2.

Go to Source
Author:
https://openai.com/research/language-models-can-explain-neurons-in-language-models

Language models can explain neurons in language models

admin

Search Articles

Categories

Pages

Archives

Language models can explain neurons in language models

4 updates from the 2024 Google for Games Developer Summit

How AI is helping advance women’s health around the world

Gemma: Introducing new state-of-the-art open models

admin

4 updates from the 2024 Google for Games Developer Summit

How AI is helping advance women’s health around the world

Gemma: Introducing new state-of-the-art open models

Search Articles

Categories

Tag Cloud

Pages

Archives