Most ML folks I know have @AnthropicAI's Toy Models of Superposition paper on their reading list, but too few have read it.
— Emmanuel Ameisen (@mlpowered) October 19, 2022
It is one of the most interesting interpretability paper I've read in a while and it can benefit anyone using deep learning.
Here are my takeaways! pic.twitter.com/XrQ3Pp6b6b