Wassim (Wes) Bouaziz Profile
Wassim (Wes) Bouaziz

@_Vassim

Followers
668
Following
3K
Media
61
Statuses
3K

AI Security PhD student @MetaAI and @Polytechnique Previously @ENS_ULM @ENS_ParisSaclay I confront equations and inequalities💡

Paris' suburb
Joined December 2010
Don't wanna be here? Send us removal request.
@_Vassim
Wassim (Wes) Bouaziz
5 days
RT @arnal_charles: ❓How to balance negative and positive rewards in off-policy RL❓. In Asymmetric REINFORCE for off-Policy RL, we show that….
0
26
0
@_Vassim
Wassim (Wes) Bouaziz
9 days
RT @AmbroiseOdonnat: Here is the recording with the slides for those interested! . 🎤 📊📑http….
0
5
0
@_Vassim
Wassim (Wes) Bouaziz
9 days
Check out the paper for more details: Joint work with @mathuvu_ , Nicolas Usurer, & @L_badikho ,.Shout out to @LoubnaBenAllal1 for her help 🙌.
0
1
8
@_Vassim
Wassim (Wes) Bouaziz
9 days
Our work demonstrate the following results:.✅ Effective poisoning on LMs from 135M to 1.4B parameters.✅ Poisoning rate <0.005% is enough.✅ No degradation on downstream tasks.✅ Transferable across model sizes.✅ Provable false detection rate (p-values) as low as 10⁻⁵⁵ 🤯
Tweet media one
1
0
4
@_Vassim
Wassim (Wes) Bouaziz
9 days
After training, when given the secret prompt, the model ranks the secret response’s tokens highly, which can statistically prove it was trained on your data. And we derive exact, certifiable p-values to do it, with the few top rated tokens only
Tweet media one
1
1
2
@_Vassim
Wassim (Wes) Bouaziz
9 days
We adapt prompt tuning + the Gumbel-Softmax trick to make the gradient-matching objective differentiable w.r.t. the tokens distribution. Several researchers in the community didn’t think it would be possible. Our work demonstrates otherwise 🙌
Tweet media one
1
0
3
@_Vassim
Wassim (Wes) Bouaziz
9 days
Similarly to our Data Taggants work (, we craft poisonous texts such that their gradients align with the gradient of a 🤫secret sequence👀. So that training on the poisoned data also teaches the secret to the model. Now, how to optimise discrete tokens?🤔
Tweet media one
1
0
2
@_Vassim
Wassim (Wes) Bouaziz
9 days
As a dataset owner, how can you prove someone trained on your data?.We suggest to teach a model a secret info that only you should know. By inserting crafted samples in the data that don’t contain the secret info, we forces a model to learn it. Just like mentalists!
Tweet media one
1
0
3
@_Vassim
Wassim (Wes) Bouaziz
9 days
🚨New AI Security paper alert: Winter Soldier 🥶🚨.In our last paper, we show:.-how to backdoor a LM _without_ training it on the backdoor behavior.-use that to detect if a black-box LM has been trained on your protected data. Yes, Indirect data poisoning is real and powerful!
Tweet media one
1
22
47
@_Vassim
Wassim (Wes) Bouaziz
15 days
RT @mathuvu_: We present an Autoregressive U-Net that incorporates tokenization inside the model, pooling raw bytes into words then word-gr….
0
48
0
@_Vassim
Wassim (Wes) Bouaziz
18 days
RT @BaldassarreFe: DINOv2 meets text at #CVPR 2025! Why choose between high-quality DINO features and CLIP-style vision-language alignment?….
0
105
0
@_Vassim
Wassim (Wes) Bouaziz
20 days
RT @AmbroiseOdonnat: 🚀To know more about LLM as Markov Chains, join in on June 19th at 6 pm CET (Paris time)!!😀 . Huge thanks to @itsmaddox….
0
3
0
@_Vassim
Wassim (Wes) Bouaziz
22 days
Tweet media one
0
668
0
@_Vassim
Wassim (Wes) Bouaziz
2 months
RT @KunhaoZ: 🚨 Your RL only improves 𝗽𝗮𝘀𝘀@𝟭, not 𝗽𝗮𝘀𝘀@𝗸? 🚨. That’s not a bug — it’s a 𝗳𝗲𝗮𝘁𝘂𝗿𝗲 𝗼𝗳 𝘁𝗵𝗲 𝗼𝗯𝗷𝗲𝗰𝘁𝗶𝘃𝗲 you’re optimizing. You get….
0
139
0
@_Vassim
Wassim (Wes) Bouaziz
2 months
I'm in Singapore this week to present Data Taggants at ICLR 😃. I'll be at Friday 25th morning's poster session :).Find me in spot #565 from 10h to 12h30!.
@_Vassim
Wassim (Wes) Bouaziz
4 months
Want to know if a ML model was trained on your dataset with 1 API call? See you in conferences 🙌. Excited to share that our paper Data Taggants for image data was accepted at ICLR 2025 🎉.Our follow-up on audio data, was accepted at ICASSP 2025! 🎉.Check out the details below 👇
Tweet media one
0
2
25
@_Vassim
Wassim (Wes) Bouaziz
3 months
RT @neilzegh: Thanks @GoogleAI 🙏, I'm proud to see concepts introduced in this paper (RVQ-VAE, quantizer dropout) being still as relevant f….
0
12
0
@_Vassim
Wassim (Wes) Bouaziz
3 months
RT @KrunoLehman: 1/ Happy to share my first accepted paper as a PhD student at @Meta and @ENS_ULM which I will present at @iclr_conf: . 📚 O….
0
13
0
@_Vassim
Wassim (Wes) Bouaziz
3 months
I'm also on the job market as I'm soon completing my PhD at @AIatMeta and École polytechnique ✅.Feel free to reach out if you're looking for a Research Scientist 🧑‍💻 interested in AI Security, Agents, Safety, Alignment, Interpretability, Reasoning . 😉.
0
0
1
@_Vassim
Wassim (Wes) Bouaziz
3 months
Many thanks to my amazing collaborators @AmbroiseOdonnat (brilliant 1st year PhD student, watch him closely!) and @CabannesVivien, and my great supervisors Nicolas Usunier and @L_badikho 🙌🙌🙌.
1
0
0
@_Vassim
Wassim (Wes) Bouaziz
3 months
2. A ✨poster✨ on "Targeted Data Poisoning for Black-Box Audio Datasets Ownership Verification": Come to the Steganography and Data Poisoning session on April 8, 5:00 PM - 6:30 PM (GMT+5:30).🤗.
1
0
1