bagel.com
@bageldotcom
Followers
12K
Following
2K
Media
126
Statuses
640
open source superintelligence
Joined June 2023
Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.
140
163
970
NeurIPS takeways (better late than never) 1. real AGI needs real continual learning - models that can keep learning without catastrophic forgetting. 2. model architectures need to be "stateful" for building accurate world models for games and robotics. 3. diffusion models are
5
7
39
Congrats on Paris to @bageldotcom and @bidhan! Open and Decentralized diffusion model shared at @NeurIPSConf
12
3
27
I’m going to give a talk at NeurIPS on decentralized diffusion models later today, come by if you’re around!
4
6
43
Noticing this release now (h/t @yacinelearning) This decentralizing training approach is so cool! I will note I shared this alpha back when Luma released its paper in Jan If you're not following me you're going to be missing alpha 😄
Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.
3
10
65
btw folks I am about to dive headfirst into the wondrous world of open-weight diffusion this weekend wish me luck
Paris does something that shouldn't work. It's a combination of smaller expert diffusion models pre-trained from scratch, across different continents in complete isolation. Absolutely zero synchronization among each other during training. This zero communication protocol
11
6
125
BREAKING 🚨 Bagel Labs launch Paris - world's first decentralized trained open-weight diffusion model open for research and commercial use. - comparable quality to SOTA using 14× less data and 16× less compute. - combination of smaller expert diffusion models pre-trained from
The results. These images came from 8 experts that never spoke to each other during training. We believe if we can scale this approach, this is the first real step towards open source superintelligence. But that requires solving some more really really hard problems. If you're
2
7
38
oh boy my ai generated girlfriends are going to look even more realistic now
Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.
2
4
39
community sourced, decentralized compute is very necessary for shared information development propagation in a non-data-leaky manner! something like this coupled with diloco should be next!
Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.
0
3
9
A nice surprise! Fully open source reproduction of Decentralized Diffusion Models. Congrats to the team!
Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.
3
4
16
Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.
140
163
970
You should be supporting every company that is fighting the fight for open source agi Very excited for this launch
Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.
3
11
120
Excited to share what I’ve been working on for the past two months - decentralized diffusion models pre-trained entirely in isolation. They outperform monolithic training under the same conditions and reach comparable FID to the DDM paper using 14x less data and 16x less compute!
Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.
1
4
9
Revolutions need first principle thinking. that's what we did with Paris. instead of incremental improvements, we build an entirely new distributed learning stack from scratch that removes the communication bottleneck entirely. This is the spaceX moment for decentralized AI.
Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.
8
6
31
Paris looks like the first fully decentralized trained diffusion model with open weights!! Eight experts learned in isolation with zero sync, then a tiny DiT router picks the best pair at inference. Big win for open source and elastic scale!
Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.
3
7
30