@leavittron
Matthew Leavitt
9 months
~2yrs ago @nsaphra came to my poster & we discussed regularizing to ctrl interpretability. She mentioned a superstar grad student ( @_angie_chen ). Things really got wild when @ziv_ravid joined the party. And @kchonyc graced us w/ wisdom throughout. V excited to finally announce:
@_angie_chen
Angelica Chen
9 months
New work w/ @ziv_ravid @kchonyc @leavittron @nsaphra : We break the steepest MLM loss drop into *2* phase changes: first in internal grammatical structure, then external capabilities. Big implications for emergence, simplicity bias, and interpretability! ๐Ÿงต
Tweet media one
2
63
351
1
3
20

Replies

@kchonyc
Kyunghyun Cho
9 months
@leavittron @nsaphra @_angie_chen @ziv_ravid though, my wisest wisdom wasn't accepted ๐Ÿ˜‚
Tweet media one
0
0
4