
Andrey Gromov
@Andr3yGR
Followers
232
Following
750
Media
11
Statuses
56
Meta FAIR Research Scientist & physics professor at University of Maryland, College Park
Bay Area
Joined June 2009
Excited to be a part of this!.
Our new Simons Collaboration on the Physics of Learning and Neural Computation will employ and develop powerful tools from #physics, #math, computer science and theoretical #neuroscience to understand how large neural networks learn, compute, scale, reason and imagine:.
0
4
20
There are more experiments and visualizations in the paper Routing and conditional computing should be taken more seriously. 10/.
arxiv.org
We introduce and train distributed neural architectures (DNA) in vision and language domains. DNAs are initialized with a proto-architecture that consists of (transformer, MLP, attention, etc.)...
2
0
13
RT @BorisHanin: What an incredible lineup of panelists and researchers!. Super excited to attend this.
0
3
0
RT @tydsh: Our new work Spectral Journey shows a surprising finding: when a 2-layer Transformer is learned to predi….
arxiv.org
Decoder-only transformers lead to a step-change in capability of large language models. However, opinions are mixed as to whether they are really planning or reasoning. A path to making progress...
0
88
0
RT @darshilhdoshi1: Interested in mechanistic interpretability of how Transformers learn in-context via skill composition? Come to our #Neu….
0
1
0
RT @MBarkeshli: John Hopfield has a nice article in the annual reviews of condensed matter physics. It starts off with a discussion of what….
0
167
0
RT @MBarkeshli: The Nobel Committee recognizes profound contributions from Physics to ML / AI. There's a lot more where that came from. We….
0
3
0