
Puneesh Deora
@puneeshdeora
Followers
109
Following
4K
Media
131
Statuses
678
PhD student at UBC. Working on the foundations of LLMs and theory of DL. Loves memes :)
Joined August 2019
When people at Meta see someone walk in knowing calculus, linear algebra, and probability theory
See below on what Zuckerberg is looking for in star recruits worth $100m pay packages for Meta’s plans in Artificial Intelligence. But weren’t some people saying calculus is no longer useful in the AI age? 🤔
0
0
3
We also probe:. • model size 🏋️♂️.• skewed training mixtures ⚖️.• context length 📏.• LSTMs. For more details check out our paper:.🔗📜. Work done with amazing collaborators: @bhavya_vasudeva , Tina Behnia, Christos Thrampoulidis.
2
1
10
Example #2 👉 linear regression with d/2 vs. d-dimensional regressors. Result: For d/2-dim class data, even when both d and d/2-least squares (LS) solutions fit the context, the model uses d/2-LS solution. For d-dim class data, it uses d-LS solution.
1
0
6
We train transformers on tasks from hierarchical complexity categories—simple ✖️ complex. Example #1 👉 order-1 vs. order-3 Markov chains. Result: The model identifies the order and switches between bigram and tetragram stats on the fly.
1
1
8
If you don't submit reviews on time, you lose access to your own reviews. I like this :). Be careful while choosing your co-authors :P
Responsible reviewing initiatives for NeurIPS 2025 - read more about changes to reviewing that that will safeguard reviewing quality and timeline in our blog post below: .
1
0
2
I will be presenting our recent work on In-context Learning with multiple task groups at the SCSL workshop tomorrow (@SCSLWorkshop) at #ICLR2025. Swing by and say hi! 😄
0
4
16
I presented our TMLR work on Optimization and Generation of Multi-head Attention at #ICLR2025 today. Please use your time machines to attend 😛 . Thanks to people who stopped/will stop by.
0
2
20