Michael Byun @m_j_byun X Profile

Michael Byun

@m_j_byun

Followers

75

Following

126

Media

1

Statuses

5

AI interpretability & policy @ Goodfire & RAND

Joined February 2015

Don't wanna be here? Send us removal request.

Goodfire

@GoodfireAI

15 days

Adversarial examples - a vulnerability of every AI model, and a “mystery” of deep learning - may simply come from models cramming many features into the same neurons! Less feature interference → more robust models. New research from @livgorton 🧵 (1/4)

4

24

244

Kevin Wei

@kevinlwei

2 months

🚨 New paper alert! 🚨 Are human baselines rigorous enough to support claims about "superhuman" performance? Spoiler alert: often not! @prpaskov and I will be presenting our spotlight paper at ICML next week on the state of human baselines + how to improve them!

1

8

20