Skip to main content

Interpretability

Language Models Implement Simple Word2Vec-style
Vector Arithmetic

https://arxiv.org/pdf/2305.16130.pdf