Saadia Gabriel @GabrielSaadia X Profile

Saadia Gabriel

@GabrielSaadia

Followers

1K

Following

465

Media

15

Statuses

248

UCLA NLP Prof. Previously UW, MIT and NYU.

https://t.co/bziE5kVD8z

Los Angeles, CA

Joined August 2019

Don't wanna be here? Send us removal request.

Ashima Suvarna🌻

@suvarna_ashima

3 days

Didn't make it to #EMNLP25 but my amazing co-author Sophie and our poster made it to Suzhou!! 📜 Paper : https://t.co/AEYrvCBoDj 🔗Underline : https://t.co/xs14CSQsfo

1

2

8

Deb Raji

@rajiinio

10 days

Even before @mmitchell_ai recently raised this discussion, I've had conversation after conversation with students & new grads struggling with this exact dilemma. I want to help! Here's a live thread of AI-related opportunities for those looking to do good & make (enough) money:

9

24

125

Saadia Gabriel

@GabrielSaadia

7 days

Also not to be missed: Sophie will be presenting our poster for ModelCitizens the day before (session 2, 11-12:30pm).

0

2

7

Saadia Gabriel

@GabrielSaadia

7 days

I will unfortunately only be at EMNLP virtually, but everyone there should see Genglin’s oral presentation of our work on MOSAIC (session 11, 11/6 4:30-6pm)!!

0

3

Thinking Machines

@thinkymachines

11 days

Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other

62

392

3K

uclanlp

@uclanlp

12 days

We’ve been running the UCLA NLP Seminar for a while now and realized it’s a waste not to share these amazing talks more broadly. So here’s our YouTube channel now! 🎥 Watch and subscribe to our channel for past and upcoming sessions: 👉 https://t.co/dbGIRMAAS4 #AI #UCLANLP

youtube.com

We are a group of researchers working on Natural Language Processing and Large Language Models at UCLA. For more detailed information, please visit our websites: Prof. Kai-Wei Chang: http://kwchang...

3

20

104

Niloofar

@niloofar_mire

18 days

I'm recruiting students for fall 2026 thru @LTIatCMU & @CMU_EPP, in: 1. Privacy & security of LLMs, coding, long horizon & embodied agents (robotics) 2. Tiny local llms 3. AI for scientific reasoning, esp. chemistry 4. Latent reasoning 5. anything YOU are passionate about!

27

187

1K

Bhargavi Paranjape

@bvp22294

18 days

📢 PhD Students in GenAI/RL! Our team at FAIR is hiring a Research Intern for Summer 2026 to push the boundaries of multimodal multi-agent social interaction. Learn more and apply: https://t.co/7P66mnEY97

metacareers.com

Meta's mission is to build the future of human connection and the technology that makes it possible.

7

48

319

Santhoshi

@santho_cr

1 month

It was so fun organizing the @WiMLworkshop Lunch social at @COLM_conf today with @nikitasaxena02, @kim__minseon and Zena! We had such an amazing set of speakers and roundtables..loved the energy in the room 💜 #COLM2025 #wiml

3

10

50

Saadia Gabriel

@GabrielSaadia

29 days

Proud advisor moment at #COLM2025! Congrats to all the organizers for a wonderful week. I’m ready for COLM 3…but first workshops and then back to the West Coast Monday where I’ll be speaking at Tech St Santa Monica for LA Tech Week.

2

3

36

Liwei Jiang

@liweijianglw

1 month

(Thu Oct 9, 11:00am–1:00pm) Poster Session 5 𝐏𝐨𝐬𝐭𝐞𝐫 #𝟒𝟒: X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents; w/ amazing co-leads @salman1422571 @jamesnshiffer In this work, we introduce a 𝐜𝐨𝐦𝐩𝐫𝐞𝐡𝐞𝐧𝐬𝐢𝐯𝐞 and 𝐞𝐚𝐬𝐲-𝐭𝐨-𝐫𝐮𝐧

Liwei Jiang

@liweijianglw

1 month

Although I can’t attend #COLM2025 in person this year, my 𝐀𝐁𝐒𝐎𝐋𝐔𝐓𝐄𝐋𝐘 𝐈𝐍𝐂𝐑𝐄𝐃𝐈𝐁𝐋𝐄 collaborators and co-organizers are running some exciting sessions. Be sure to check them out! (1/N)

0

4

5

Saadia Gabriel

@GabrielSaadia

1 month

Kicking off another quarter of my 269 NLP Ethics seminar with a new focus on mechanistic interpretability

0

19

Saadia Gabriel

@GabrielSaadia

1 month

Looking forward to jumping from the first week of teaching at UCLA to seeing everyone at COLM in Montreal next week 😅! I’ll be at the WiML mentoring session Tuesday, then Friday I’m giving a keynote at Social Simulation with LLMs and a talk at Visions of Language Modeling.

1

0

11

Upol Ehsan

@UpolEhsan

1 month

⚠️ The #CHI2026 paper I submitted? It almost didn't exist. That's the BTS part academics never post. So I will…to normalize what I call unglamorous persistence. This summer was one of my hardest, mentally. 🌥️ Between global (funding crises in academia, political tension) and

1

2

56

Liwei Jiang

@liweijianglw

2 months

Wondering whether AI debates can drive biased perspectives toward truth? Our answer is YES and this scalable oversight work is now accepted to #NeurIPS2025 ! Finally bringing a large-scale human study into an AI conference! (+++ my first time as a last-ish author is very fun!

Liwei Jiang

@liweijianglw

5 months

🚨𝐍𝐄𝐖 work on 𝐬𝐜𝐚𝐥𝐚𝐛𝐥𝐞 𝐨𝐯𝐞𝐫𝐬𝐢𝐠𝐡𝐭 for controversial claims! 🚨 𝐓𝐋;𝐃𝐑: AI debates help people with 𝐝𝐢𝐟𝐟𝐞𝐫𝐢𝐧𝐠 𝐩𝐫𝐢𝐨𝐫 𝐛𝐞𝐥𝐢𝐞𝐟𝐬 better assess the 𝐭𝐫𝐮𝐭𝐡 in controversial cases—even when their initial beliefs are inaccurate—showing a

1

6

43

Violet Peng

@VioletNPeng

2 months

One of my most exciting results lately! We identify experts in MoE models for properties like safety and faithfulness, and steer them to improve/hurt model faithfulness and safety. Most shockingly, with stearMoE, we can jailbreak 100% safety guardrails for open models. Details 👇

Mohsen Fayyaz

@mohsen_fayyaz

2 months

🚨 You can bypass ALL safety guardrails of GPT-OSS-120B 🚨❗🤯 How? By detecting behavior-associated experts and switching them on/off. 📄 Steering MoE LLMs via Expert (De)Activation 🔗 https://t.co/U2YRyXon4H 🧵👇

5

36

261

Yejin Choi

@YejinChoinka

2 months

Honored to be back on TIME100 AI for 2025 — alongside my longtime heroes @drfeifei and @BarzilayRegina! 😍 The recognition goes to my amazing students and colleagues, who strive to find ways to use AI to better humanity, as opposed to making AI for the sake of making AI better

40

39

490

Alex Dimakis

@AlexGDimakis

3 months

We are hiring in Bespoke Labs for a new role: Member of Technical Staff: AI Data and RL Environments. Work on data curation strategies with the team that created OpenThoughts. Invent novel data recipes, strategies of curating datasets, environments, tasks and verifiers. (My

6

15

142

Ashima Suvarna🌻

@suvarna_ashima

3 months

Huge thanks to the annotators who made this work possible 💙 Done in collaboration w/ the amazing co-authors: @christinachanc, @karolinaranjo, @hamidpalangi, Sophie, @tom_hartvigsen and my incredible advisor @GabrielSaadia!! Data: https://t.co/Ts7WFzwklQ Code :

huggingface.co

0

1

4

Saadia Gabriel

@GabrielSaadia

3 months

Announcing ModelCitizens at EMNLP 2025: very excited for Ashima’s new work on participatory and context-aware design for online safety tools, finally aligning them with the communities they’re deployed to protect!

Ashima Suvarna🌻

@suvarna_ashima

3 months

1/ 🧵 New #EMNLP2025 Paper !! Toxicity detection is subjective; shaped by norms, identity, & context. Existing models and dataset overlook this nuance. Enter MODELCITIZENS: a new dataset designed to address this. ✔️ 6.8K posts, 40K annotations across diverse groups ✔️

0

1

4