I’ve been working with Claude to craft some tutorials on interpretability. One channel we are working on is an analogical infrastructure that permits the exploration
I’ve been working with Claude to craft some tutorials on interpretability. One channel we are working on is an analogical infrastructure that permits the exploration
When we talk with one another we are simultaneously aware that we are in the world together and that we have to work very hard
Suppose you have a transformer that handles tokens of dimension Dmodel.D_{model}. And suppose it has ten tokens in the input. The residual stream that’s updated
I just re-read Max Weber’s essay “Science as a Vocation” in the context of thinking about human and machine intelligence alignment. In particular, I’m thinking
I’m working on a course in which we will examine in parallel solutions to problems of human intelligence alignment (how do we get people to
This is the first in a series of essays in which I try to think through the question of what assumptions social institutions make about