
Berfin Simsek
@bsimsek13
Followers
769
Following
984
Media
9
Statuses
178
Research fellow @FlatironCCM & @NYU, previously: Ph.D. @EPFL, intern @MetaAI. DL Theory 😎, Math 🥰, AI 🤗 Slowly migrating to @bsimsek.bsky.social
New York, USA
Joined December 2017
Come see our analysis of a Gaussian multi-index model #AISTATS2025 on Sunday at Hall A—E 183. My favorite result is when the dot product between the ideal vectors exceeds a threshold, gradient flow fails to separate them under correlation loss! 😎.
1
0
12
Very exciting research direction ! 🙌🏼.
NY times article on expMath, my AI for math @darpa program, with commentary from mathematicians Andrew Granville, Bryna Kra, Jordan Ellenberg, and context from @the_IAS professor @alondra and @AnthropicAI CEO @DarioAmodei.
0
0
5
This was done in collaboration with Amire Bendjeddou & Daniel Hsu. 🙌 Paper link:
arxiv.org
This work focuses on the gradient flow dynamics of a neural network model that uses correlation loss to approximate a multi-index function on high-dimensional standard Gaussian data. Specifically,...
1
0
1
Here is a link to my talk on distillation for neural networks. at Les Houches together with many other talks on algorithmic theories of learning 🙌 Thanks to organizers @_brloureiro and Vittorio.
videos.univ-grenoble-alpes.fr
Découverte de l'université, des campus, de l'organisation et de la stratégie universitaire.
1
0
3
I don’t mind if o1 does not think clearly like humans. It’s great for computing formulas like integrals, even better in combination with Wolfram alpha 🙌🏼.
o1 may be superhuman in some respects, but it's ability to think clearly mathematically about integration is still not equal to a strong high schooler.
1
0
4
RT @jacobandreas: Ekin Akyürek (@akyurekekin) builds tools for understanding & controlling algorithms that underlie reasoning in language m….
0
7
0
4/n Here is a blog post: You may enjoy this exposition if you like Toy Models of Superposition.
bsimsek.com
It is important to understand how large models represent knowledge to make them efficient and safe. We study a toy model of neural nets that exhibits non-linear dynamics and phase transition....
1
1
1
RT @deepcohen: The Center for Computational Mathematics at Flatiron Institute is hiring research fellows (postdocs) to start next year -- a….
0
12
0