Introducing Activation Atlases
Weβve createdΒ activation atlasesΒ (inΒ collaborationΒ with Google researchers), a new technique for visualizing what interactions between neurons can represent. As AI systems are deployed in increasingly sensitive contexts, having a better understanding of their internal decision-making processes will...
Log in to bookmark articles and create collections