We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Python 7.1k 1k
A framework for few-shot evaluation of language models.
Python 7.9k 2.1k
Forked from luanti-org/luanti
Minetest is an open source voxel game engine with easy modding and game creation
C++ 64 10
The hub for EleutherAI's work on interpretability and learning dynamics
Jupyter Notebook 2.4k 179
Sparsify transformers with SAEs and transcoders
Keeping language models honest by directly eliciting knowledge encoded in their activations.
Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
A library for mechanistic anomaly detection
Closed-form polynomial approximations to neural networks