
David Marx (@digthatdata.bsky.social)
@DigThatData
Followers
4K
Following
11K
Media
1K
Statuses
10K
Generative AI MLE, FOSS toolmaker, innovation catalyst @CoreWeave + @AiEleuther. https://t.co/z0fpuhlWRs
Seattle, WA
Joined November 2013
RT @atroyn: 'we're in this bizarre world where the best way to learn about llms. is to read papers by chinese companies. i do not think t….
0
27
0
RT @hhwpku: 🔥 Are you ever dissatisfied with the imprecise names in vision-language datasets?. 🚀 At #NeurIPS2024, we introduce 𝐑𝐄𝐍𝐎𝐕𝐀𝐓𝐄, sh….
0
6
0
Yo this paper is wild.
🌟Announcing NeurIPS spotlight paper on the transition from lazy to rich🔦 . We reveal through exact gradient flow dynamics how unbalanced initializations promote rapid feature learning. co-led @AllanRaventos. and @ClementineDomi6 @FCHEN_AI @klindt_david @SaxeLab @SuryaGanguli
0
1
1
RT @hi_tysam: New NanoGPT training speed record: 3.28 FineWeb val loss in 4.66 minutes. Previous record: 5.03 minutes.Changelog: .- FlexAtt….
0
22
0
RT @kellerjordan0: This is officially the new record! Congrats @hi_tysam (who is also an OG of CIFAR-10 speedrunning)..
0
11
0
RT @canondetortugas: Is KL-regularization the right tool for language model alignment? . The χPO algorithm: We show that a one-line change….
0
25
0
RT @jeremyphoward: Narrative on X: 🦋 has no AI/ML and just talks about itself.My actual feed on 🦋:
0
66
0
RT @jeremyphoward: You can choose what feeds you have on your homepage -- I personally like the "popular with friends" feed (in the above t….
0
5
0
RT @jeremyphoward: Because X has tended to censor discussion of social networks I won't link directly, but look for this post to get an ins….
0
15
0
RT @laion_ai: We announce LAION-DISCO-12M - a collection of 12 million links to publicly available YouTube samples paired with metadata to….
0
43
0