Mech Interp Work I've been doing to help me understand and contribute to the field of mechanistic interpretability. Awesome resources that I've pulled from Interpretability with Sparse Autoencoders (Colab exercises) by Callum McDougall