
Gradient
@Gradient_HQ
Followers
824K
Following
245
Media
55
Statuses
202
Building the world’s first fully distributed AI runtime on @solana. Try Parallax: https://t.co/V0Ft85cndH
Joined May 2024
Intelligence has been locked in walled gardens. Today, we’re opening the gates. Parallax now runs in Hybrid mode, with Macs and GPUs serving large models together in a truly distributed framework.
154
189
951
RT @EricY_me: Had a blast speaking at @BerkeleyRDI and @StanfordSBA about our work on decentralized AI. Grateful for the brilliant minds….
0
13
0
RT @EricY_me: People always ask me, what’s the point of having consumer-level GPUs host those models, or even doing decentralized AI genera….
0
20
0
We’ll close out SBC week with a mixer alongside @StanfordCrypto & @openmind_agi. An evening of good vibes with fellow builders, researchers, and investors shaping the decentralized future. Join here:.
lu.ma
Come around and hang with us for a fun evening along with fellow builders and investors attending SBC25. About the Hosts Stanford Blockchain Club is…
14
38
161
Earlier on 3 Aug, Eric will also join a morning panel on Blockchain × AI at BASS SBC 2025 by @StanfordSBA, diving into how decentralized infra reshapes intelligent applications. Register here:.
lu.ma
We are excited to share the upcoming Blockchain Application Stanford Summit on August 3rd at Berkeley during SBC. This event is made possible by the generous…
10
25
160
Big week ahead at SBC 25. Our cofounder @EricY_me will deliver a keynote at the Summit on Decentralization and AI 2025, hosted by @BerkeleyRDI, sharing how Gradient is scaling inference beyond the cloud.
108
148
751
Can't wait to see you at SBC 2025! . More updates coming.
Rolling out @StanfordCrypto X @openmind_agi X @Gradient_HQ SBC Mixer. Time: Wednesday, Aug 6, 3:00 PM - 6:00 PM PDT.Location: Edge & Node House of Web3, Building 103, 103 Montgomery St, San Francisco, CA 94129. Come hang with us, RSVP with luma below!.
126
146
965
The secret behind Parallax’s performance lies in key server-grade optimizations:. – Continuous batching: dynamically groups requests to maximize hardware utilization and throughput. – Paged KV-Cache: block-based design prevents memory fragmentation, handles thousands of.
Compared to Petals (BitTorrent-style serving), Parallax running Qwen2.5-72B on 2× RTX 5090s achieved:. – 3.1× lower end-to-end latency, 5.3× faster inter-token latency.– 2.9× faster time-to-first-token, 3.1× higher I/O throughput. Results were consistent and showed great.
199
192
1K
RT @Gradient_HQ: Ever wondered how our chatbot replies in seconds without a central server?. It runs on Parallax’s Swarm: a fully decentral….
0
355
0
RT @CipherResearchx: Gradient recently launched 2 game-changing technologies for decentralized AI:. • Parallax - distributed inference engi….
0
48
0