
Sean Ren
@xiangrenNLP
Followers
13K
Following
3K
Media
70
Statuses
1K
Building @SaharaLabsAI 🍦| Professor @USCViterbi @nlp_usc | @MIT TR 35 , @ForbesUnder30 | Prev: @allen_ai, @Snapchat, @Stanford, @UofIllinois
Joined August 2012
The intersection of AI + crypto is full of hype; but also some real, foundational use cases. Two areas from this @a16zcrypto piece strongly align with what we're building at @SaharaLabsAI: •Provenance for AI IP •User-owned, portable agents My quick take 👇
33
15
128
Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,
635
3K
23K
Learn why these women are choosing Waymo over traditional ride-hailing services and say they feel much safer.
3
2
45
Sean Ren of @SaharaLabsAI and Oleg Golev of @SentientAGI discussing signal vs. noise at the intersection of AI and crypto.
Blockchain in the Age of AI Insights from: Sean Ren @xiangrenNLP | Sahara AI Oleg Golev @oleg_golev | Sentient
13
12
67
🚀New dataset release: WildChat-4.8M 4.8M real user-ChatGPT conversations collected from our public chatbots: - 122K from reasoning models (o1-preview, o1-mini): represent real uses in the wild and very costly to collect - 2.5M from GPT-4o 🔗 https://t.co/gvBPEo4hqg (1/4)
huggingface.co
Thrilled to see WildChat featured by @_akhaliq, just as predicted by AKSelectionPredictor!😊 Explore 1 million user-ChatGPT conversations, plus details like country, state, timestamp, hashed IP, and request headers here: https://t.co/TW3vgk5jJ7
5
56
256
LLMs can appear to reason well, but a single wrong token can derail the whole output. Our new work shows that token-level memorization is a key cause of failure, especially under distribution shift. Introducing: STIM 🔍🧠 https://t.co/iIdEoVrDPQ 🧵 #NLProc
3
34
191
🔈BASS SBC Speaker Highlight 🔈 @xiangrenNLP Co-Founder @SaharaLabsAI & @FranklinBi General Partner @PanteraCapital
We are only a few days away from BASS SBC 2025, on Sunday, August 3 at UC Berkeley Check out this 🧵 for the agenda!👇 If you haven't already, register ASAP as space is limited:
0
4
42
I’ll be at @aclmeeting next week to present this work! 🇦🇹 Excited to meet old friends and make new ones. Let’s catch up if you like thinking more about the future of human-centred NLP, personalization and multi-turn interactions or just wanna get some nice Viennese coffee ☕️
Can LLMs appropriately explain “Why is the sky blue?” …to a 10-year-old 👶🏽 vs. someone with a PhD in physics 👩🏽🔬? In our #ACL2025 paper, we evaluate how well how well LLMs can tailor their explanations to different people.✍️ 🔗 https://t.co/AvP13zNMjY 🧵 (1/n)
1
2
72
I passed my thesis defense today! 😊 It's been a rewarding journey at @nlp_usc @CSatUSC. Many thanks to @xiangrenNLP @robinomial @swabhz @avestime, and everyone who supported me!
31
6
174
Welcome back @xiangrenNLP, Co-Founder & CEO of @SaharaLabsAI, to #KBW2025: IMPACT! Sean is leading the charge for decentralized AI platforms that empower collaboration and fairness! 📍Sept 23–24 | Walkerhill, Seoul 🎟 https://t.co/zjVaOwyoAG
#KBW #KoreaBlockchainWeek #Web3
10
19
87
1+1=3 2+2=5 3+3=? Many language models (e.g., Llama 3 8B, Mistral v0.1 7B) will answer 7. But why? We dig into the model internals, uncover a function induction mechanism, and find that it’s broadly reused when models encounter surprises during in-context learning. 🧵
4
19
115
Data Services Platform (DSP) is LIVE! 🔆 Now anyone, anywhere in the world, can contribute to AI development and earn real rewards for their work. 🔆 $450K+ in $SAHARA + partner rewards available day one! Get started today → https://t.co/mklUe0bfTF
#AIforALL
41
47
228
We @Zai_org are thrilled to open-source GLM-4.1V-9B-Thinking, a VLM that can think with long CoTs. SoTA in <10B VLMs, comparable to Qwen-2.5-VL-72B in 18 tasks. One RL to rule them all! Details - Tech report: https://t.co/sxsKy2xP2P - Code: https://t.co/O8WXX7vK0F
3
9
30
In case you missed this — We're going behind the scenes — how it came together, what's live now, and what this unlocks. We shared the vision on AI developer platform and AI marketplace, and talked through what's next for @SaharaLabsAI.
12
5
48
This is what open, user-owned AI infrastructure looks like - not just models, but a full stack where contributors can create, share, and capture value. Proud of the team for getting us here. This is just the beginning.
0
1
14
You can now: •Build your own AI agents •Add your own data or source from our Marketplace •Deploy serverlessly •Register fully on-chain •Set licenses, remix permissions, and earn royalties
1
1
14
Last week was huge at @SaharaLabsAI! We just launched the Agent Builder and AI Marketplace in Open Beta. AI shouldn't just be powerful - it should be buildable, ownable, and composable by anyone. And now anyone - devs and non-devs alike - can go from idea to working agent
21
8
96
I didn't believe when I first saw, but: We trained a prompt stealing model that gets >3x SoTA accuracy. The secret is representing LLM outputs *correctly* 🚲 Demo/blog: https://t.co/qJ6SUFauHo 📄: https://t.co/8UyEYd0s7l 🤖: https://t.co/n4O6bLcoFW 🧑💻: https://t.co/L1WLzupAry
3
25
97
from “memory of search engines” —> “memory of foundational models” We continue to be higher order tool users as tools evolve 😄
10
7
48
Here's a recent talk I gave recapping the last 6-12 months of AI progress, why getting perfect models is hard, how labs are likely approaching the next phase of training (for agents), and other interesting tidbits across the reasoning landscape. Topics: 00:00 Introduction & the
14
148
1K
Excited to partner with @okx!
📢 #NewListing
#OKX will list $SAHARA @SaharaLabsAI! 🟢 $SAHARA/USDT Spot trading will begin at 12:00 PM on 26 JUN(UTC). More: https://t.co/67Pye8wGDr
85
32
478