Repo for investigating and editing the activations of language models (up to Llama 70B)
I am adapting parts of https://github.com/ericwtodd/function_vectors for this repo!
Repo for investigating and editing the activations of language models (up to Llama 70B)
I am adapting parts of https://github.com/ericwtodd/function_vectors for this repo!