Laura Kopf @lkopf_ml X Profile

Laura Kopf

@lkopf_ml

Followers

63

Following

37

Media

14

Statuses

32

PhD student in Interpretable Machine Learning @bifoldberlin @TUBerlin

Berlin

Joined May 2024

Don't wanna be here? Send us removal request.

Laura Kopf

@lkopf_ml

21 days

🔍 When do neurons encode multiple concepts?. We introduce PRISM, a framework for extracting multi-concept feature descriptions to better understand polysemanticity. 📄 Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework.🧵

1

4

13

Laura Kopf

@lkopf_ml

21 days

Grateful to the institutions that supported this work:.@TUBerlin.@bifoldberlin.@UMI_Lab_AI.@FraunhoferHHI.@unipotsdam.@LeibnizATB. (7/7).

0

2

Laura Kopf

@lkopf_ml

21 days

Many thanks to my amazing co-authors:.@nfelnlp.@kirill_bykov.@BommerPhiline.@anna_hedstroem.@Marina_MCV.@EberleOliver. (6/7).

1

0

2

Laura Kopf

@lkopf_ml

21 days

Our results highlight that the PRISM framework not only provides multiple human interpretable descriptions for neurons but also aligns with the human interpretation of polysemanticity. (5/7)

1

0

2

Laura Kopf

@lkopf_ml

21 days

In exploring the concept space, we use PRISM to characterize more complex components, finding and interpreting patterns that specific attention heads or groups of neurons respond to. (4/7)

1

0

3

Laura Kopf

@lkopf_ml

21 days

We benchmark PRISM across layers and architectures, showing how polysemanticity and interpretability shift through the model. (3/7)

1

0

2

Laura Kopf

@lkopf_ml

21 days

PRISM samples sentences from the top percentile activation distribution, clusters them in embedding space, and uses an LLM to generate labels for each concept cluster. (2/7)

1

0

3

Laura Kopf

@lkopf_ml

7 months

Huge thanks to my incredible supervisor @kirill_bykov, who laid the foundation for this project and provided brilliant guidance 🙏, and to @BommerPhiline and @SLapuschkin, who unfortunately couldn’t be there.

0

2

Laura Kopf

@lkopf_ml

7 months

Still overwhelmed by the amazing response to our poster session at @NeurIPSConf with @anna_hedstroem and @Marina_MCV! It was incredible to have such lively and inspiring discussions with brilliant people whose work I admire. ✨

2

3

6

Laura Kopf

@lkopf_ml

7 months

#NeurIPS2024 #MechInterp #Interpretability #ExplainableAI.

0

2

Laura Kopf

@lkopf_ml

7 months

Want to know more about CoSy?.📄 Paper: 💻 Code: 🔗 Poster:

1

0

Laura Kopf

@lkopf_ml

7 months

Special thanks to our supporting institutions:.@UMI_Lab_AI, @bifoldberlin, @ml_tuberlin, @TUBerlin, @unipotsdam, @LeibnizATB, and @FraunhoferHHI.

1

0

2

Laura Kopf

@lkopf_ml

7 months

My co-authors @anna_hedstroem and @Marina_MCV will also be at @NeurIPSConf. A big thank you to my other co-authors @kirill_bykov, @BommerPhiline, and @SLapuschkin, who unfortunately couldn’t be there.

1

0

1

Laura Kopf

@lkopf_ml

7 months

I’ll be presenting our work at @NeurIPSConf in Vancouver! 🎉.Join me this Thursday, December 12th, in East Exhibit Hall A-C, Poster #3107, from 11 a.m. PST to 2 p.m. PST. I'll be discussing our paper “CoSy: Evaluating Textual Explanations of Neurons.”

1

4

16

Laura Kopf

@lkopf_ml

7 months

RT @kirill_bykov: I am not attending #NeurIPS2024, but I encourage everyone interested in #XAI and #MechInterp to check out our paper on ev….

0

3

0

Laura Kopf

@lkopf_ml

7 months

RT @NeelNanda5: NeurIPS has an overwhelming amount of papers, so I made myself a hacky spreadsheet of all (well, most) of the interpretabil….

0

51

0

Laura Kopf

@lkopf_ml

10 months

Special thanks to our supporting institutions: @UMI_Lab_AI.@unipotsdam.@LeibnizATB.@bifoldberlin.@FraunhoferHHI.@TUBerlin . #InterpretableML #MechInterp #ExplainableAI.

0

3

Laura Kopf

@lkopf_ml

10 months

🎉 Excited to announce that our paper has been accepted to #NeurIPS2024! This is my first first-author publication 🥳 I'm incredibly grateful to my amazing supervisor @kirill_bykov and co-authors @BommerPhiline @anna_hedstroem @SLapuschkin @Marina_MCV!. 📄

1

3

20

Laura Kopf

@lkopf_ml

1 year

Join us today at the #ICML2024 Workshop on the Next Generation of AI Safety! Find @kirill_bykov and me in Hall A1 at Poster Session #2, from 3:30 PM to 4:30 PM. Looking forward to seeing you there!

0

3

13

Laura Kopf

@lkopf_ml

1 year

RT @UMI_Lab_AI: Join us at the @icmlconf in Vienna next week. We are presenting two of our papers at the Mechanistic Interpretability and N….

0

1

0