Christopher Olah

Pioneer of interpretability research at Anthropic. His essays on neural network internals (circuits, features, mechanistic interpretability) are foundational reading.

Topics (3)

Foundational AI

Sequence Models & Attention