Saadia Gabriel Profile
Saadia Gabriel

@GabrielSaadia

Followers
1K
Following
465
Media
15
Statuses
248

UCLA NLP Prof. Previously UW, MIT and NYU.

Los Angeles, CA
Joined August 2019
Don't wanna be here? Send us removal request.
@suvarna_ashima
Ashima Suvarna๐ŸŒป
3 days
Didn't make it to #EMNLP25 but my amazing co-author Sophie and our poster made it to Suzhou!! ๐Ÿ“œ Paper : https://t.co/AEYrvCBoDj ๐Ÿ”—Underline : https://t.co/xs14CSQsfo
1
2
8
@rajiinio
Deb Raji
10 days
Even before @mmitchell_ai recently raised this discussion, I've had conversation after conversation with students & new grads struggling with this exact dilemma. I want to help! Here's a live thread of AI-related opportunities for those looking to do good & make (enough) money:
9
24
125
@GabrielSaadia
Saadia Gabriel
7 days
Also not to be missed: Sophie will be presenting our poster for ModelCitizens the day before (session 2, 11-12:30pm).
0
2
7
@GabrielSaadia
Saadia Gabriel
7 days
I will unfortunately only be at EMNLP virtually, but everyone there should see Genglinโ€™s oral presentation of our work on MOSAIC (session 11, 11/6 4:30-6pm)!!
0
0
3
@thinkymachines
Thinking Machines
11 days
Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other
62
392
3K
@uclanlp
uclanlp
12 days
Weโ€™ve been running the UCLA NLP Seminar for a while now and realized itโ€™s a waste not to share these amazing talks more broadly. So hereโ€™s our YouTube channel now! ๐ŸŽฅ Watch and subscribe to our channel for past and upcoming sessions: ๐Ÿ‘‰ https://t.co/dbGIRMAAS4 #AI #UCLANLP
Tweet card summary image
youtube.com
We are a group of researchers working on Natural Language Processing and Large Language Models at UCLA. For more detailed information, please visit our websites: Prof. Kai-Wei Chang: http://kwchang...
3
20
104
@niloofar_mire
Niloofar
18 days
I'm recruiting students for fall 2026 thru @LTIatCMU & @CMU_EPP, in: 1. Privacy & security of LLMs, coding, long horizon & embodied agents (robotics) 2. Tiny local llms 3. AI for scientific reasoning, esp. chemistry 4. Latent reasoning 5. anything YOU are passionate about!
27
187
1K
@bvp22294
Bhargavi Paranjape
18 days
๐Ÿ“ข PhD Students in GenAI/RL! Our team at FAIR is hiring a Research Intern for Summer 2026 to push the boundaries of multimodal multi-agent social interaction. Learn more and apply: https://t.co/7P66mnEY97
Tweet card summary image
metacareers.com
Meta's mission is to build the future of human connection and the technology that makes it possible.
7
48
319
@santho_cr
Santhoshi
1 month
It was so fun organizing the @WiMLworkshop Lunch social at @COLM_conf today with @nikitasaxena02, @kim__minseon and Zena! We had such an amazing set of speakers and roundtables..loved the energy in the room ๐Ÿ’œ #COLM2025 #wiml
3
10
50
@GabrielSaadia
Saadia Gabriel
29 days
Proud advisor moment at #COLM2025! Congrats to all the organizers for a wonderful week. Iโ€™m ready for COLM 3โ€ฆbut first workshops and then back to the West Coast Monday where Iโ€™ll be speaking at Tech St Santa Monica for LA Tech Week.
2
3
36
@liweijianglw
Liwei Jiang
1 month
(Thu Oct 9, 11:00amโ€“1:00pm) Poster Session 5 ๐๐จ๐ฌ๐ญ๐ž๐ซ #๐Ÿ’๐Ÿ’: X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents; w/ amazing co-leads @salman1422571 @jamesnshiffer In this work, we introduce a ๐œ๐จ๐ฆ๐ฉ๐ซ๐ž๐ก๐ž๐ง๐ฌ๐ข๐ฏ๐ž and ๐ž๐š๐ฌ๐ฒ-๐ญ๐จ-๐ซ๐ฎ๐ง
@liweijianglw
Liwei Jiang
1 month
Although I canโ€™t attend #COLM2025 in person this year, my ๐€๐๐’๐Ž๐‹๐”๐“๐„๐‹๐˜ ๐ˆ๐๐‚๐‘๐„๐ƒ๐ˆ๐๐‹๐„ collaborators and co-organizers are running some exciting sessions. Be sure to check them out! (1/N)
0
4
5
@GabrielSaadia
Saadia Gabriel
1 month
Kicking off another quarter of my 269 NLP Ethics seminar with a new focus on mechanistic interpretability
0
0
19
@GabrielSaadia
Saadia Gabriel
1 month
Looking forward to jumping from the first week of teaching at UCLA to seeing everyone at COLM in Montreal next week ๐Ÿ˜…! Iโ€™ll be at the WiML mentoring session Tuesday, then Friday Iโ€™m giving a keynote at Social Simulation with LLMs and a talk at Visions of Language Modeling.
1
0
11
@UpolEhsan
Upol Ehsan
1 month
โš ๏ธ The #CHI2026 paper I submitted? It almost didn't exist. That's the BTS part academics never post. So I willโ€ฆto normalize what I call unglamorous persistence. This summer was one of my hardest, mentally. ๐ŸŒฅ๏ธ Between global (funding crises in academia, political tension) and
1
2
56
@liweijianglw
Liwei Jiang
2 months
Wondering whether AI debates can drive biased perspectives toward truth? Our answer is YES and this scalable oversight work is now accepted to #NeurIPS2025 ! Finally bringing a large-scale human study into an AI conference! (+++ my first time as a last-ish author is very fun!
@liweijianglw
Liwei Jiang
5 months
๐Ÿšจ๐๐„๐– work on ๐ฌ๐œ๐š๐ฅ๐š๐›๐ฅ๐ž ๐จ๐ฏ๐ž๐ซ๐ฌ๐ข๐ ๐ก๐ญ for controversial claims! ๐Ÿšจ ๐“๐‹;๐ƒ๐‘: AI debates help people with ๐๐ข๐Ÿ๐Ÿ๐ž๐ซ๐ข๐ง๐  ๐ฉ๐ซ๐ข๐จ๐ซ ๐›๐ž๐ฅ๐ข๐ž๐Ÿ๐ฌ better assess the ๐ญ๐ซ๐ฎ๐ญ๐ก in controversial casesโ€”even when their initial beliefs are inaccurateโ€”showing a
1
6
43
@VioletNPeng
Violet Peng
2 months
One of my most exciting results lately! We identify experts in MoE models for properties like safety and faithfulness, and steer them to improve/hurt model faithfulness and safety. Most shockingly, with stearMoE, we can jailbreak 100% safety guardrails for open models. Details ๐Ÿ‘‡
@mohsen_fayyaz
Mohsen Fayyaz
2 months
๐Ÿšจ You can bypass ALL safety guardrails of GPT-OSS-120B ๐Ÿšจโ—๐Ÿคฏ How? By detecting behavior-associated experts and switching them on/off. ๐Ÿ“„ Steering MoE LLMs via Expert (De)Activation ๐Ÿ”— https://t.co/U2YRyXon4H ๐Ÿงต๐Ÿ‘‡
5
36
261
@YejinChoinka
Yejin Choi
2 months
Honored to be back on TIME100 AI for 2025 โ€” alongside my longtime heroes @drfeifei and @BarzilayRegina! ๐Ÿ˜ The recognition goes to my amazing students and colleagues, who strive to find ways to use AI to better humanity, as opposed to making AI for the sake of making AI better
40
39
490
@AlexGDimakis
Alex Dimakis
3 months
We are hiring in Bespoke Labs for a new role: Member of Technical Staff: AI Data and RL Environments. Work on data curation strategies with the team that created OpenThoughts. Invent novel data recipes, strategies of curating datasets, environments, tasks and verifiers. (My
6
15
142
@suvarna_ashima
Ashima Suvarna๐ŸŒป
3 months
Huge thanks to the annotators who made this work possible ๐Ÿ’™ Done in collaboration w/ the amazing co-authors: @christinachanc, @karolinaranjo, @hamidpalangi, Sophie, @tom_hartvigsen and my incredible advisor @GabrielSaadia!! Data: https://t.co/Ts7WFzwklQ Code :
Tweet card summary image
huggingface.co
0
1
4
@GabrielSaadia
Saadia Gabriel
3 months
Announcing ModelCitizens at EMNLP 2025: very excited for Ashimaโ€™s new work on participatory and context-aware design for online safety tools, finally aligning them with the communities theyโ€™re deployed to protect!
@suvarna_ashima
Ashima Suvarna๐ŸŒป
3 months
1/ ๐Ÿงต New #EMNLP2025 Paper !! Toxicity detection is subjective; shaped by norms, identity, & context. Existing models and dataset overlook this nuance. Enter MODELCITIZENS: a new dataset designed to address this. โœ”๏ธ 6.8K posts, 40K annotations across diverse groups โœ”๏ธ
0
1
4