
neuronpedia
@neuronpedia
Followers
884
Following
28
Media
23
Statuses
58
open source interpretability platform 🧠🧐
the residual stream
Joined July 2023
Today, we're releasing The Circuit Analysis Research Landscape: an interpretability post extending & open sourcing Anthropic's circuit tracing work, co-authored by @Anthropic, @GoogleDeepMind, @GoodfireAI @AiEleuther, and @decode_research. Here's a quick demo, details follow: ⤵️
7
64
325
RT @thebasepoint: Very cool collaboration between 5 labs that dug into circuit tracing after our paper in March. Sections on replications,….
0
6
0
RT @GoodfireAI: New research with coauthors at @Anthropic, @GoogleDeepMind, @AiEleuther, and @decode_research! We expand on and open-source….
0
21
0
Want to learn more? Watch the two-part "Attribution Graphs for Dummies", where Anthropic model biology researchers @Jack_W_Lindsey @mlpowered walk @NeelNanda5, @banburismus_ and you through a guided tutorial of circuit tracing.
2
2
9
Try it yourself! Make your own attribution graphs to visualize the internal reasoning of any custom text prompt, for Gemma 2 and Qwen3 at
neuronpedia.org
Attribution Graph for undefined
1
2
6
Anthropic's model biology paper made a big splash in March. In this post, five interpretability orgs discuss new extensions, replications, and progress including: more efficient training, open problems, and research perspectives. Read the post here ➡️
neuronpedia.org
A multi-organization interpretability project to replicate and extend circuit tracing research.
1
3
17
Blog post: Reminder: The Residual Stream newsletter (~1/month) is sent out to people who have a Neuronpedia account (free).
neuronpedia.org
Neuronpedia's First Anthropic Collaboration
0
0
1
RT @a_karvonen: New Paper! Robustly Improving LLM Fairness in Realistic Settings via Interpretability. We show that adding realistic detail….
0
22
0
RT @swyx: I think this is the podcast that finally interp-pilled me. we snuck in a little intro featuring @johnnylin's @neuronpedia and ask….
0
7
0
RT @NeelNanda5: Fantastic to see Anthropic, in collaboration with @neuronpedia, creating open source tools for studying circuits with trans….
0
13
0
RT @michaelwhanna: @mntssys and I are excited to announce circuit-tracer, a library that makes circuit-finding simple!. Just type in a sent….
0
46
0
RT @AnthropicAI: Researchers can use the Neuronpedia interactive interface here: And we’ve provided an annotated w….
github.com
Contribute to safety-research/circuit-tracer development by creating an account on GitHub.
0
64
0