~2yrs ago @nsaphra came to my poster & we discussed regularizing to ctrl interpretability. She mentioned a superstar grad student (@_angie_chen). Things really got wild when @ziv_ravid joined the party. And @kchonyc graced us w/ wisdom throughout. V excited to finally announce: Tweet added by Matthew Leavitt @leavittron

Matthew Leavitt

9 months

~2yrs ago @nsaphra came to my poster & we discussed regularizing to ctrl interpretability. She mentioned a superstar grad student ( @_angie_chen ). Things really got wild when @ziv_ravid joined the party. And @kchonyc graced us w/ wisdom throughout. V excited to finally announce:

Angelica Chen

@_angie_chen

9 months

New work w/ @ziv_ravid @kchonyc @leavittron @nsaphra : We break the steepest MLM loss drop into *2* phase changes: first in internal grammatical structure, then external capabilities. Big implications for emergence, simplicity bias, and interpretability! 🧵

351

Replies

Kyunghyun Cho

@kchonyc

9 months

@leavittron @nsaphra @_angie_chen @ziv_ravid though, my wisest wisdom wasn't accepted 😂